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because they relate to subject matter not required to be searched by this Authority, namely: 

Although claims 16, 19 and 22 are directed to a method of treatment of the 
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3. | | Claims Nos.: 
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' ■ searchable claims. 
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1. Claims: 1-23 (partially) 

An isolated polypeptide having sequence as set forth in 
Seq.Id.No.l of the sequence listing and its nucleic acid 
sequence as set forth in Seq. Id. No. 23. Methods of production 
and uses thereof. 



2. Claims: 1-23 (partially) 
Inventions 2-22: 

Same as for invention no.l but respectively to each 
following pair of aminoacid sequences with their respective 
nucleic acid sequences: 

Invention 2: Seq.Id.Nos.2 and 24. 

Invention 3: Seq.Id.Nos.3 and 25. 



Invention 22: Seq. Id Nos.22 and 44. 



INTERNATIONAL SEARCH REPORT 

International Application No 
information on patent family members ^^-^^^ 

^ PC^B 00/05621 



nationon 



Patent document ^^^ubNcation Patent family Publication 

cited in search report date member(s) date 

WO 9922243 A 06-05-1999 AU 1273499 A 17-05-1999 

EP 1042674 A 11-10-2000 

AU 1118499 A 17-05-1999 

W0 9921575 A 06-05-1999 



WO 


9830696 


A 


16- 


-07- 


1998 


us 


5945302 


A 


31-08-1999 














1 IC 


CQ70 C C O 


A 

A 


0£ in 1 QQQ 














AU 


2723397 


A 


29-10-1997 














EP 


0907732 


A 


14-04-1999 














EP 


1007660 


A 


14-06-2000 














^ d 


cUUUOUOl / c. 


T 
1 


PlZL_fl7 OC\QC\ 
UH - VJ/ - cUuU 














AU 


5822398 


A 


03-08-1998 














AU 


2558599 


A 


26-07-1999 














EP 


1047713 


A 


02-11-2000 














WO 


9935168 


A 


15-07-1999 


WO 


9832853 


A 


30- 


•07- 


1998 


AU 


5828398 


A 


18-08-1998 














EP 


1007661 


A 


14-06-2000 


wo 


9840486 


A 


i7- 


■oy- 


iyy« 


US 


5976837 


A 


02-11-1999 














AU 


6702298 


A 


29-09-1998 














EP 


0973890 


A 


26-01-2000 



INTERNATIONAL 



CH REPORT 



Intcmat^M^LpplicaUon No. 
PCT/l4^B2376 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :Please Sec Extra Sheet 

US CL :Please See Extra Sheet. 
According to International Patent Classification (IPC) or to both national classification and IPC 



R FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 536/23.5, 23.1; 435/320.1, 440, 252.3, 69.1, 7.1; 530/350. 387.1; 514/12 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please Sec Extra Sheet 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document with indication, where appropriate, of the relevant passages 



ADAMS et al. Complementary DNA sequencing: Expressed 
sequence tags and the human genome project. Science. 21 June 
1991, Vol. 252, pages 1651-1656, see entire document. 



Relevant to claim No. 



1-22 



| | Further documents are listed in the continuation of Box C. | | Sec patent family annex. 



* Special categories of cited documents: 

■A" document defining the general state of the art which is not considered 

to be of particular relevance 

"E" earlier document published on or after the international filing date 

"L" document which may throw doubt* on priority claim(s) or which is 

cited to establish the publication date of another citation or other 
special reason (as specified) 

"O* document referring to an oral disclosure, use, exhibition or other 

means 

"P" document published prior to the international filing date but later than 
the priority date claimed 



later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance, the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 



13 JANUARY 1999 



Date of mailing of the international search report 

03 FEB 1999 



Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 2023 1 
Facsimile No. (703) 305-3230 



Authorized 

JAMr/sMARTrNELL 
Telephone No. (703) 308-0196 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This intcmationai report has not been established in respect of certain claims under Article 17(2Xa) for the following reasons: 
Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



1. Claims Nos 



2. r7r\ Claims Nos.: 23 

' ' because they relate to parts of the international application that do not comply with the prescribed requirements to such 

an extent that no meaningful international search can be carried out, specifically: 

Claim 23 is directed to a product of the process of claim 20. claim 20 is not a process for the production of a product, 
but a process for the detection of a substance, hence, no meaningful search can be carried out. 

3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



** I I As ai * re< l uircd addiuonai search fees were timely paid by the applicant, this international search report covers all searchable 



claims. 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. | | As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. I I No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

G01N 33/68. 33/53; C07K 16/00; C12N 15/11, 15/12, 15/00, 15/63; A61K 38/17, 38/16; C12P 21/02 

A. CLASSIFICATION OF SUBJECT MATTER: 
US CL : 

536/23.5, 23.1; 435/320.1. 440, 252.3, 69.1, 7.1; 530/350. 387.1; 514/12 

B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

APS. STN, MPSRCH (SEQ ID NOs 1 1 and 160 only). One nucleotide sequence and one amono acid sequence have 
been searched. It is not clear which sequences are embraced by the claims because the claims refer to sequences X and 
Y. The table beginning after page 209 contains many sequences X and Y, yet the claims refer to X and Y in the 
singular. If the claims are to embrace more than one X and more than one Y, it is not clear whether each X always 
requires the corresponding sequence Y. Additionally, the claims are in improper format in referring to the description 
(see PCT Rule 6.2(a)). Accordingly, the first X nucleotide sequence disclosed and the first Y amino acid sequence 
mentioned in the claims were searched. 



Form PCT/ISA/210 (extra sheetXJuly 1992)* 

BNSDOCID ,WO 9922243A1 I > 
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PATENT COOPERATION TREATY 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCT Article 1 8 and Rules 43 and 44) 



INCYTE PHARMACEUTICALS , INC. et al . 



^KSK.fTj ^^^^^^^ ^ ~d ,o the applicant 



This Internationa) Search Report consists of a total of 5_ 



Si'itfcfis. 



• . — . IVCIO. 

It is also accompanied by a copy of each prior art document cited in this report. 



1 . Basis of the report 

a ss: r c e h rrasr^^r^ basis of the in — ' ,„ 

D Z^TJe a 2l e S WaS ° Ut °" the basis «* * tranrtiflon of the international application furnished to this 

b " ^ cS E?£2*5 S^^JSlr"^ diSC ' OSed < h * — iona, application, the international search 

|_Aj contained in the international application in written form. 
□ filed together with the international application in computer readable form, 
furn.shed subsequently to this Authority in written form, 
furnished subsequently to this Authority in computer readble form. 



□ 



MME" MqUenCB ^ ^ ™ °° ^ th. disclosure in the 
th^atemen, that the .formation recorded in computer readable form is identical to the written sequence listing has been 



2- (XJ Certain claims were found unsearchable (See Box I). 

3- fX[ Unity of invention is lacking (see Box II). 

4. With regard to the title, 

[X] the text is approved as submitted by the applicant. 

□ the ^xt has been established by this Authority to read as follows: 



5. With regard to the abstract, 

P[] the text is approved as submitted by the applicant. 

[— 1 the text has been established, according to Rule 38 2(b) hv thic a, ,th„ ■♦ 

* one month from the date of « ffi^ l!l tIMtt 1 may - 

6. The figure of the drawings to be published with the abstract ,s Figure No. 



"1 



because the applicant failed to suggest a figure, 
because this figure better characterizes the invention. 



None of the figures. 
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d 0 2 APR 2002 



vvipo 



PCT 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 
(PCT Article 36 and Rule 70) 



Applicant's or agent's tile reference 
PF-0675 PCT 



International application No. 
PCT USOO, 05621 



See Notification of Transmittal of International 
FOR FURTHER AC 1 ION preliminary Examination Report < Form PCT IPEA 416> 



International filing date (Jay /month/year) 
03 March 2000 (03.03.2CXX)) 



Priority date (day -month, xean 
05 March 1999 (05.03 1999) 



International Patent Classification dPO or national classification and IPC 



IPC(7): C12N 1 68; G01N 33 53 and US CI 435 6, 69.1, 320.1. 536 23 1; 530 3(X); 424 184 
Applicant 



INCYTE PHARMACEUTICALS. INC 



Tins international prelmnnary examination report lias been prepared by tins International Preliminary 
Examining Authority and is transmitted to the applicant according to Article 36. 



ei Micci. 



This REPORT consists of a total ot^_ sheets, including tins covi 

| | This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings 
winch have been amended and are the basis for this report and/or sheets containing rectifications made 
before tins Authority (see Rule 70. 16 and Section 607 of the Administrative Instructions under the PCT). 



lese aimexes consist of a total 



\ 

VI 
VI 



otO 



sheets. 



ns report contains indications relating to the following items: 

[ • ] Basis of the report 
| | Priority 

P~] Non-establishment of report with regard to novelty, inventive step and industrial applicability 
1 1 Lack of unir\ of invention 

[~] Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability: citations and explanations supporting such statement 

j ] Certaui documents cited 

| | Certam defects in the international application 
[""■"] Certaui observations on the international application 



Date of submission of the demand 

26 September 2(Xn i i 26. 09. 2000 . 



Date of completion of tins report 
Fehnian 2(X>: t 2"? .02.200:. 



\:i!Tie ;>::d ni;ii: 



dres 



the 1PF A I S 



Facsimile No (■"= 305 32 3*> 




Form PCT IPEA ^'^uucr *diee: ■ Ji;a : 9^ 



I International apj^^^n No 
INTERNATIONAL PRILIMIN-^^PvAMINATION RKPORT , pcj fS(X) 0561 



I. Basis of the report 



Wiih regard to the elements of the intemauon:tJ application. * 

["•"] tlie international application as onenialh tiled. 

[ ' ] the description: 

panes 1-74 as originally filed 

pages NONE _. filed with the demand 

pages NONE , hied with the letter ot _ 

[ • ] the claims: 

pages 75-77 1 as originally filed 

pages NQNE , as amended (together with any statement) under Article IV 

pages NONE , hied with the demand 

paees NONE , tiled with the letter of 

| 1 the drawings: 

pages NONE . as originally tiled 

pages NONE . tiled with the demand 

pages NONE filed with the letter ot 

P~] the sequence listing pan of the description: 
paees 1-28 as origmallv tiled 

panes NONE ~ , tiled with the demand 
pages NONE , filed with the letter of 



pages inwi^c. * * 

- Wtth repanfto the language, all d* elements .narked above were available or furnished to tins Authority .n the 
"' language in winch the international apphcat.on was filed, unless otherwise indicated under tins item. 
These elements were avaUable or furnished to tins Authority in the following language which is. 

□ the language of a translation furnished for the purposes of international search (under Rule23. 1 (b)>. 
| | ,he language of publication of the international application (under Rule 48.3(b)). 

□ die language of the translation furnished for the purposes of international preliminary exammanon(under Rules 
55.2 and/or 55.3). 

3 With regard to anv nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was earned out on the basis ot the sequence listing: 
H\] contained in the international application in printed form 

filed together with die international application in computer readable form. 
! | furnished subsequently to this Authority in written fonn 
| | furnished subsequently to this Authority in computer readable fonn. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed lias been furnished 

□ The statement that the information recorded ... computer readable fonn ,s identical to the written sequence listing 

has been furnished 

4 r ~\ The amendments have resulted m the cancellation ot 

| | the description, pages NONE 

| | the claims, Nos. NONE 

! | the drawings, sheets 6g NONE 

5 n This report has been established as it , some of - the amendments lud not been made, since they liave been considered to go 

hevnnd the disclosure as filed, as indited m the Supplemental Box .Ride 7l).2.o. ** 
♦ ^placement sheets ukuk >n,e Ivcn Crushed to the r t ,r<uw OHue in response to an mutation under Ar J^ 7 ^^- 
, r ', n )r , . v .. >r , it , r , }h . , T/( , ;n .j ;r „ „, ; Mim - w d!--!h^ report wm ,< thex do tint contain amendments (Rules 70 IfianJ 7o.l ■) 
< • Anx'n'nll emen! dieet 'amniinm^ uch amendments must he referred to under ucin I and annexed to tins report 



Form POT 1PEA 4<N i Bo\ \ Jul\ 19S>s 
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International appb^faon No 
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PCT USOO 056?,^. | 
. - 1 



III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicabiliu 



The question wh ether the claimed invention appears to he novel, to involve an inventive step (to he non-obwous>. or 
to he industrial^ applicable have not been and will not be examined m respect ot: 

[ | the entire international application, 

[X] claims Nos. Chums 1-23 m pari, as thc \ read on SEQ ID NO 2 22. : 4-4-1 
because: 

□ the said international application, or die said claim Nos. relate to the following subject matter wliich does 

not require international preliminary examination (specify): 



□ the description, claims or drawings (indicate particular elements below or said claims Nos. are so unclear) 

tliat no meaning till opinion could be formed (speci fy } : 



□ the claims, or said claims Nos __ _ are so inadequately supported bv the description that no meaningful 
opinion could be formed. 

[Vl no international search report has been established tor said claims Nos ; 23 n: ran. ShQ 2 22. 24-44 



- \ meaningful international preliminary examination cannot be carried out due lo the failure of the nucleotide and or amino acid 
sequence hsung to comph with ilie standard provided lor m Annex C of the Administrative Instructions: 
| | aie written form has not been furnished or does not comply vwih the standard 

[<] the compiuer readable form has not been mrmshed or docs not eomph with the standard 



Form PCT 1PEA 409 . Box III ■ .Jul\ 1 , 



II 



INTERNATIONAL PRKUM1V 



AMINA HON RE FORT 



International upy. 
PCT L'SOO OScv 



T; Reasoned statement under Rule 66.2(a)(ii) ^ithT^d to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



STATEMENT 

Novelty (N) 

Inventive Step ( IS) 

Industrial Applicability (1A) 



Claims NONE 



Claims 1-23 

Claims NONE 
Claims 103 



YES 
NO 



YES 
NO 



Claims 1-23 



Claims NONE 



YES 
NO 



2 CITATIONS AND EXPLANATIONS 

^he sequences disclosed in the priority document and the related computer readable format are not the same as the 
sequences in the application, and are not in the same order as the sequences in this application. Therefore, the 
application is not entitled to the to me prionty date. 



UOLUHieilLb SJLCU. vvw z> j i \->, <^--< •>-•••• — ... ■-, 



Claims 1-" lack novelty under PCT Article 33,2, as being anticipated by WO 99 22243 .HUMAN GENOME SCIENCES i 06 May 
1999 This publication 'discloses SEQ ID NO: 215 having identity' to SEQ ID NO: 1 and/or 23. Polypeptides, fragments, and _ac ivc 
trails are disclosed, as are polynucleotides, vectors, transformed host cells. Methods of making and usmg me polynucleotide 

and or polypeptide are disclosed. 

Claims 1-23 meet die criteria set out in PCT Article 33«4t tor industrial applicability. 
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PCT L'S(K) 05621 



VIII. Certain observations on the international application 



The following observations on the clarify of the claims, description, and drawings or on the questions whether the claims | 
are fully supported by the description, are made: 

The description is objected to under PCT Rule 66.2ia>< v, as lacking clarity under PCT Article 5 because u tails to adequately enable 
practice of the clauned invention because: no biological activity has been ascribed to die polypeptide oi SEQ ID NO: 1 In Table 2, 
only a predicted signal sequence is identified. Table 3 discloses that short fragments of SEQ ID NO: 23 appear to be expressed in 
musculoskeletal tissue and reproductive tissue, and indicates general categories of potentially related diseases. No spec it ic data 
regarding the claimed sequences and am parucular biological activity is disclosed. Lacking the knowledge ot a particular biological 
activity, one would not be able to make or identify biologically active fragments of die polypeptide ot SEQ ID NO 1 Similarly , the 
description fails to adequately enable methods of diagnosing and treaung a disease If die specific biological activm . and its 
relationship in a named disease or medical condition is not described, one of skill in the an would not be able to praciice die claimed 
medio ds. 

Claims 1-23 are objected to as lacking clarity under PCT Rule 66.2ia)i v, because practice of the clauned invention is not enabled as 
required under PCT Ride 5. leu for the reasons set forth in die immediately preceding paragraph. 
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201 1 South Clark Place Room 
CP2/5C24 
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ETATS-UNIS D'AMERIQUE 

in its capacity as elected Office 


Date of mailing (day month year) 

04 May 2001 (04.05.01) 




International application No. 

PCT/US00/05621 


Applicant's or agent's file reference 

PF-0675 PCT 


International filing date (day/month year) 

03 March 2000 (03.03.00) 


Priority date (day/month/year) 

05 March 1999 (05:03.99) 


Add! tcant 

TANG, Y., Tom et al 



1. The designated Office is hereby notified of its election made: 



X in the demand filed with the International Preliminary Examining Authority on: 

26 September 2000 (26.09.00) 

| | in a notice effecting later election filed with the International Bureau on: 



2. The election | X | 

□ 



made before the expiration of 1 9 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 
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Secretejd and Membrane Proteins 
hal Seq jence Trap Method 

TAfeHIRO, Tomoyuki Nakamura, Nelson D. Lopez, 

TOSHIHIKO Wo, TORU NAKANO, 2 AND TASUKU HONJO 3 



We recently described a method, called the j lignal 
sequence trap (SST) method, to clone cDNAs o# ^se- 
creted proteins and/or type I transmembrane pij >teins 
containing N- terminal signal sequences by usji ag!an 
epitope- tagging expression plasmid vector. In ti is pa- 
per we describe the summary of a large-scale si rrieen- 
incr of ftonroxfmAtAlv $900 clones of nn SST cDNA? li- 
brary constructed firom mouse bone marrow si rbmal 
cell line ST-2 cells. Of 26 positive clones obtained! and 
sequenced, 11 clones appeared to contain autoeritic 
signal sequences. Five of the clones correspon te^ to 
the 5' ends of the cDNA of known genes contain 1 Ln 1 ^ N- 
terminal signal sequences. The full-length j cDNA 
clones of the 6 other unknown clones were >lated 
and sequenced. One clone, termed SDF3, encoded a 
mouse counterpart of human pigment epit belli j» -de- 
rived factor. Another clone, termed SDR1, had Consid- 
erable homology with basigin, a member of the { nbbu- 
noglobulin superfamily. A third clone, termed SRF5, 
had partial homology with a Drosophila tissue p< larity 
gene frizzled {ft) and its rat homologues, fz-1 ai dj/k-2. 
The other three clones had no significant hoi lology 
with sequences in the databases. These result t indi- 
cate that the SST method is effective and useful I or fche 
isolation of secreted and membrane proteins vi ithbut 
knowledge of their functions, o i»w Academic iv— j pin j 



INTRODUCTION 



The elucidation of molecular mechanisms foi 
cellular signaling and cell adhesion is essential 



The nucleotide sequences reported in this paper have 1 
ited with the GenBank/EMBL Data Libraries under / 
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ders ending the mechanism for the development and 
duTe rentiation of multicellular organisms. Although a 
large number of molecules involved in signaling or ad- 
hesion have been cloned, there still remain many un- 
known molecules that are important for these func- 
tions. Most of the molecules involved in signaling or 
adhesion are secreted or membrane- anchored proteins, 
suetj as growth factors, honnunes, neuropeptides, and 
their receptors or adhesion molecules. Many of these 
proteins contain hydrophobic signal sequences at the 
N;-te rmini of the precursor forms. To establish a gen- 
eral; cDNA cloning strategy for these secreted or mem- 
bra* e-anchored proteins, we developed a new cloning 
met lod for the isolation of cDNA fragments encoding 
N^tcrminal signal sequences (Tashiro et al. w 1993). 
cDMA fragments encoding signal sequences were de- 
tected by surface expression of the IL-2 receptor a chain 
(Tac) (Uchiyama et al., 1981) as fusion proteins in 
tr ari sfected cells. Using this method, termed the signal 
sequence trap (SST) method, we have so far reported 
tne : isolation of cDNA for a member of the a-chemo- 
kines, SDFla/£, and a growth factor receptor lympho- 
tbxii 0 receptor, by the SST method (Tashiro et al., 
1&9S; Nakamura et al., 1995), suggesting that this 
method may be widely useful. 

e hematopoietic system is one of the best charac- 
terized organs for analyzing the mechanisms of mam- 
malian cell growth and differentiation. Bone marrow 
al cell lines support hematopoiesis from bone 
w stem cells by secretion of cytokines and by in- 
tion with cell-adhesion molecules. Many mole- 

involved in cell -cell interactions during lympho- 

hematopoiesis have been cloned and are known to be 
expressed in bone marrow stromal cell lines (Metcalf, 
1&89; Paul et al., 1991). However, there are still a num- 
f unknown surface molecules involved in the early 
es of hematopoiesis. We therefore performed a 
scale screening of cDNAs encoding secreted and 
smembrane proteins firom a bone marrow stromal 
line ST-2 (Nishikawa et al, 1988) using the SST 
odL Isolated cDNAs will be expressed as proteins 
their functions could be tested in an inductive sys- 
tenJof lymphohematopoiesis from embryonic stem cells 
(friano et al., 1994). 
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Summary of t i* trapped < DNA Clones of SST 



library 



Clone 



Homology 

ORF in j 

frame to Known N -terrain* 
ATAC sequences SS 



Library 1 



Library II < 



HI 


+ 


two 




H3 


+ 


H4 


+ 


SI 


+ 


S2 


+ 


S3 


+ 


S4 


+ 


S6a 


+ 


Tl 


+ 


T2 


+ 


T4 


+ 


T5 


+ 


T7 


+ 


T8 


+ 


E6a 


+ 


F7a 


+ 


NT1 


+ 


NT2 


+ 


NT3 


+ 


NT4 


+ 


NTS 


+ 


NT6 




P5c 




R5g 


+ 


TA 


+ 



* T, true; F, false; U, undecided. 



We will summarize the results of the la 
screening of the SST library of ST-2 cells to 
the ratio of clones with the authentic signal 
to artifacts and to test if this strategy is v; 
general cloning method for isolation of signed i 
containing molecules. Ten clones with autheri 
sequences were obtained from approximately j 5900 
cDNA clones. The six full-length cDNA clones! 
novel proteins were isolated, sequenced, and 
ized. 



MATERIALS AND METHODS 



Construction and screening of the SST library, Co: 
the SST library was performed as described previously 
aL, 1993) except that two sets of sonicated cDNA ~ 
two size ranges (one between 300 and 500 bp and the 
600 and 700 bp) were selected and used as inserts of the 
About 10,000 clones of the shorter insert cDNA 
were plated on LB agar plates, and colony hybridizatk i 
formed with the probes of da (Ikeda et al., 1991), JE (It 
1988), and SDF1 according to the conventional method 
et aL, 1989). Colonies that did not hybridize with any 
were picked up and spotted on 9-cm agar plates with a 
(7 rows by 7 lines) (Tashiro et aL, in press), 
preselected colonies were screened after 49 clones wet 



mi irixtfbr 
ApprarLn atery 



SHIROZU AL. 

! \ TABLE 

i i 



[ | Hydrophobic 
peak 



: f 



Short ; 



+ 
+ 



+ 

Short 



Short 



ge-scale 
xrimine 
equence 
as a 
iiience- 
_ signal 

dcoding 
character - 



>ni ruition < 




Stop codon 
upstream of 
initiation 
codon 



Located at the 
5' end of full- 
length cDNA 



Signal 
sequence* 



Name of full- 
length clone 



+ 
+ 



+ 
+ 



T 
F 
U 
F 



T 
T 
U 
F 
T 
T 
U 
F 
T 
F 
U 
T 

F 
U 
T 
T 
F 
T 
F 
F 
F 
T 



Osteonectin 



Peptidyl-glycine 
a -ami dating 
monooxygenase 

SDR1 

SDF3 



SDR2 
SDF2 



Ribophorin 1 

Protein disulfide 
isomerase 



Fibronectin 
SDF4 

SDF5 



Osteonectin 



de&cribed before (Tashiro et ai., 1993). The longer insert library (li- 
br iry II) was screened without preselection. 

WA sequencing. DNA sequencing was carried out using double- 
st anded templates by the dideoxy method using the Tag DyeDeoxy 
te minator cycle sequencing kit and an automated DNA sequencer 
, (3 3A, Applied Biosys terns). The sequence information was used for 
; ai alysia of hydrophobicity and compared with sequences in the Gen- 
Bi ok database for homology using BLAST and FAST A. 

WA extraction and Northern blot analysis. Crude RNAs of 
mi use tissues were extracted by using an RNA extraction kit (TKl* 
Zi L, GIBCO BRL), and poly(A)" RNAs were enriched by oligo(dT) 
la ex (015gotex(dT) Super, Takara). Two micrograms of mouse tissue 
pc y(A)* RNA was electrophoresed through a 1.0% agarose gel and 
tr insferred to a nylon membrane (BIODYNE, Pall) in lOx SSC (lx 
3i C is 0.15 M Nad, 15 mAf sodium citrate) overnight, and cross- 
In ked with a UV crosshnker (Funahnker, Funakoshi). The mem- 
bl im was serially used for hybridization with the SDF3, SDF4, 
S3 >Fo, SDR1, SDR2, and mouse £-actin probes radiolabeled by the 
re adom priming method (Sambrook et al, 1989). The probes for the 
m w genes were full-length cDNAs. Hybridization was performed 
at wording to a conventional method, and washing was performed 
ui der stringent conditions (Sambrook et at, 1989). Autoradiograms 
w$re analysed with a Bio-image analyzer (BAS 2000, Fuji Film). 

RESULTS 

cj )NA Library Screening 

In the previous study, a small-scale screening (ap- 
p oximately 600 clones) of the SST library with 300- to 
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APPLICATION OF 



Hydropathy profile Hydrophobic peak Hydropathy profit 
^ * *o .1" -no 'M> ^ tfl i*B lU B n« 



Bydrophi bid rfeak 



SI 



-tio- 

■ 120- 



S2 



S3 



S4 



S6a 



T1 



C+) 



( + > 



T7'" 



-t«0 



F7a 



t-) 



RSg 



FIG. 1. (A) Hydropathy profiles of the signal sequen£ 
identical to known genes or carrying short ORF* (fewer 
a hydrophobic peak. (B) Hydropathy profiles of the full 
computer program based on Kyte and Doolittle (1982). 



500-bp inserts (Tashiro et al., 1993) led to the i 

isolation of three clones, Gla, JE, and SDF1. Tr> Tidlude 
such abundant clones two strategies were inti Sliced: 
(a) elimination of the abundant clones by colony^ hybrid- 
ization (library I) and (b) construction of an SS^JUbrary 



Summary of Characterizat? in 



St&NAL, SEQl "ENCE TRAP METHOD 



B 



A 
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SOqi (KT3) 



I 320 
160 
0 

160 
320 



SPy (KT5) 

s 



r » » If 1 * 1 If 





pped clone 
amino acic 
cDNA of 



Hydropathy profiles of the 14 trapped clones excluding those 
residues) are shown. (+) or (— ) indicates a clone with or without 
identified clones. Hydropathy profiles were analyzed with a 



n< wly 



ted wi h longer inserts (library II), which should remove 



TABLE : 

of the cDfcA Clones Obtained by ST-2 SST 



th4 three clones mentioned above because their coding 
sei [uences are less than 500 bp. The two strategies are 
coi nplementary to some extent, because the former is 
te< ious but does not exclude other shorter cDNA clones 



cDNA length ORF length 



Clone name 


(kb) 


(aa) 


SDF1 


3.4 


93 




1.8 


89 


SDF2 


1.0 


211 


SDF3 


1.4 


417 


SDF4 


1.8 


366 


SDF5 


1.8 


296 


SDR1 


1.9 


281 


SDR2 


2.4 


592 



Hot alary 



Sizes of 
mRNACkb) 



Tissue distribution of mRNA 




a-Chernoki: b 
Yeast Pmtj p a{id Pmt2p 

Pigment e{ 

derived 
4 EF 
Droaoj 

fritiUd-1 
Ig superfi 

chicken 



3.4, 1.8 Ubiquitous (peripheral blood 

leukocyte—) 
1.0 Ubiquitous 

1.6 Liver++, brain +, heart+, 

muscle +, spleen +, thymus + 
5.0, 1.8 Ubiquitous 
4.0, 2.2, L5 Heart+, kidney+, lung+, 

brain ±, thymus ± 
2.6, 2.1 2.1 kb, Ubiquitous 

2.6 kb, brain-specific 
6.8, 2.6 Kidney +, liver ±, spleen ±, 
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A 

m£OT3 



CXDM 
PDT 




mnaa 

CBT7 

FIG. 2. Comparison of the amino acid sequences oflplj)!^, SDF5, anjd 
of the amino acid sequences of SDP3 and human pi, 
of SDF5 and Drosophila fz and its homologues, (rat 
and chicken HT7. The putative transmembrane domains 
Human; m, mouse; r, rat; d r Drosophila; c, chicken, 



gm< nt 



while the latter might miss clones with ope a ; leading 
frames (ORFs) shorter than 500 bp 

About 3500 clones of library I were screenejc as de- 
scribed before (Tashiro et aL, 1993) and 15 
clones were obtained. On the other hand, 11 ,, 
clones were obtained by screening about 2400 c 



SDRl with their homologues or related proteins. (A) Compariso 
jpithelium-^erived factor (PEDF). CB) Comparison of the amino acid sequence 
Comparison of the ammo acid sequences of SDRl, mouse basigii 
The identical amino acid residues are indicated by open boxes. I 



iahd fz-2). (C> 
are underlined I 



positive 
jositive 
tones of 



library II. The ratios of positive clones to artij afci clones 



were similar between the two libraries and 



26 clones isolated will be characterized bel6w. ! 

: . j 

Sequence Analysis of Positive SST cDNA CUvies 

The nucleotide and amino acid sequences of [the in- 
serts of the 26 positive clones were 
compared with sequences in the GenBank 
homology. As summarized in Table 1, 
clones identical to partial sequences of know^i 
follows: HI and TA, osteonectin precursor 
1988); H4, peptidyl-glycine a-amidating moj* 
ase precursor (Eipper et aL, 1992); T5, 
precursor (rough endoplasmic reticulum 
hal et aL, 1990); E6a, protein disulfide 



deterti lined 
daUfcse 

the:-ej 
>wut 
(Hpfje 
no: 3i 
ribo; 
pre te] 



isom»rj 



ali of the 




cursor (rough endoplasmic reticulum protein) (Gong e 
i iL, 1988); NT2, fibronectin precursor (Schwarzbauer e 
i iL, 1987). Two signal sequence-trapped clones, HI ant 
' TA, corresponded to the 5' portions of osteonectin pre 
i tursor cDNA, but the inserts of the two clones were no 
dentdcal because they were derived from two distinc 
ibraries. The hydropathy profiles of 14 clones an 
i ihown in Fig. 1A, whereas those of the other 12 clone 
: dentical to known genes or carrying no or very shor 
< }RFs (fewer than 35 amino acid residues) are excluded 
Structural examination of the 26 trapped cDN^ 
;lones did not allow us to conclude that the following 
L5 clones have authentic signal sequences. Threi 
Jones, H2, NT6, and P5c, had no ORFs in frame witl 
he Tac cDNA sequence of the cloning vector. H4 wai 
dentical to part of a known gene, but it did not corre 
tpond to the N- terminal signal sequence (data no 
ihown). S4, T4, T7, and NT4 had no obvious hydropho 
)ic peaks following the putative first methionine resi 
lues. H3, T2, and T8 had very short ORFs (fewer thai 
{5 aa) though their sequences had hydrophobic proper 
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0.24 



FIG. ». Tissue distribution of SDF3, SDF4, SDFS, SDR1, ai x 
RNA from heart, brain, kidney, liver, lung, spleen, and thymus 
(C), SDR1 (D), SDR2 (E), or 0-actin (F) cDNA. The filter was*. 



I SDR2 mRNAs. The same Northern blot «ter^te.nmg 2 of ^A) 
frislwas hybridized serially with a DNA probe of SDFS (A). SDF4 (B). SDF5 
™" shed after each exposure. Size marker- are indicated m blobases. 



ties (data not shown), and therefore it was very difficult 



U«S kUilUi* nui, - 

to judge whether the sequences were authentic sijjiial 
sequences or not. As S3 and NT 1 had hydrophobic >~- 
gions far from the first putative methionine reaidui i 
and 64 residues, respectively), we did not cY 
them further although these hydrophobic 
may serve as signal sequences. The trapped 
of F7a and R6g did not locate at the 5' ends of the 
length cDNA clones and these clones were not - 
terized further. 

The following five known clones and six 
clones were concluded to contain authentic signs 1 
quences. Five clones, HI. T5, E6a, NT2, and T& 
identical to the N-terminal signal sequences of 
genes {Table 1). The trapped fragments of six 
(SI, S2, S6a, Tl, NT3, and NT5) were located at} 
ends of the full-length cDNA clones. 

Sequence Analysis of Full-Length cDNAs of the 
Identified Clones 

Full-length cDNA of the six unknown clones 
authentic signal sequences were isolated, 



re- 
72 

characterize 
sequences 
fragments 
f adl- 
chiufac- 

unkriqwn 
se- 
vere 
ofkikcwn 
clcnes 
thB5' 



with 
sequejnced, 



and characterized (Table 2). Detailed data of isolation 
and characterization of the SDF-2 clone were described 
elsewhere (Hamada et al., in press). The hydropathy 
profiles of the deduced amino acid sequences were plot- 
ted (Fig IB). All the clones have the typical N-terminal 
signal sequences and none of them have KDEL signals 
for retention in the endoplasmic reticulum (Pelham et 
al 1989) A stromal cell-derived receptor (SDR) or stro- 
mal cell-derived factor (SDF) was named depending 
on the presence or absence, respectively, of a putative 
transmembrane region. SDR1 bears a clear hydropho- 
bic region near the C-terminus and SDR2 has six hy- 
drophobic domains at the C-tenmnus. 

The sequence homology search was performed using 
GenBank and SWISS-PROT databases. SDF2 has sig- 
nificant homology to putative enzymic regions of yeast 
dolichyl phosphate-D-mannose: protein mannosyl- 
transferase, Pmtlp, and Pmt2p (Hamada et al., in 
oress) SDF3 has remarkable homology to the human 
pigment epithehum-derived factor (PEDF) (Steel* » - 
al 1992) (Fig. 2A). There were 78 and 85* identities 
at 'the nucleotide and amino acid sequence levels, re- 
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Further 
investigation 
is optional. 



> Discard 



Identical to the known 
sequences excluding N 
terminal SS 

* Discard 



spectively. This overall similarity strongly) 
that SDF3 is a mouse counterpart for the 
PEDF. SDF4 has four EF hand motifs for calc 
ing activity (Kretsinger and Nockolds, 1973).| 
shows some homology to the extracellular di 
the DrosophUa tissue polarity gene frizzled (Vif 
a/., 1989) and its rat homologues fz-1 and fz-2'\ 
aL, 1992) (Fig. 2B), but its C-terminus has no hoi&logy 
with frizzled. SDR1 has a considerable hoinolqgy to 
mouse basigin (Altruda et al., 1989) and chicken ffiT?, 
members of the immunoglobulin superfamily (Fig. 2C). 
At the amino acid sequence level there is 38<& identity 
between SDR1 and basigin, 38% identity bi^ween 
SDR1 and HT7, and 28% identity among the thrtfe pro- 
teins, while there is 45% identity between basigiii and 
HT7. SDR2 shows no significant similarity to fchown 
genes. i j 

: :■ I jj 

Expression ofmRNA of Novel Signal Sequence j ^ 
Trapped Clones 




Discard 



FIG. 4. The strategy for selection of SST porittrve cDNA fragments. 53, signal sequence. 



Br and spleen. A strong 2.1-kb transcript of the SDR1 
me was expressed ubiquitously but a 2.6-kb tran- 
sept was detected only in brain. SDR2 mRNA was 
relatively abundant in kidney. 



of 



The tissue distribution of SDF3, SDF4, SDF6, 9DR1, 
and SDR2 mRNAs was examined by Northenl blot 
analysis (Fig. 3). SDF3, SDF4, SDR1, andl &DR2 
mRNAs were expressed in almost all the organs t&sted, 
although the expression levels were variable. ;i Several 
observations are worth noting. SDF3 mRI^Ar was 
strongly expressed in liver. SDF5 mRNA was ex; rfessed 
in brain, heart, kidney, lung, and thymus, but Aot in 



DISCUSSION 



In this report, we described a large-scale screening 
two ST-2 SST libraries. Twenty-six positive clones 
Were obtained from 59 00 clones from the two libraries. 
Sequence analysis of the 26 clones revealed that 11 
dk nes contained authentic signal sequences (true), 10 
ck nes were classified as artifacts (false), and 5 clones 
were classified as undecided (undecided) (Table 1). Ar- 
tifacts described here are either technical or strategic. 
TUe former sequences do not contain signal sequences 
at all, as judged by the absence of ORFs (H2, NT6, and 
PSc)or hydrophobic regions (H4, S4, T4, T7, and NT4). 
Tne latter sequences do not contain authentic N-termi- 
,rM signal sequences but do have hydrophobic se- 
auences in the middle of the full-length cDNA clones 

t a and R5g). Undecided clones contain atypical hy- 
phobic sequences: very short hydrophobic ORFs 
(H3, T2, and T8) and very long hydrophobic sequences 
(^3 and NT1). Given the frequencies of artifacts (38%) 
arid novel genes (23%), we conclude that SST is an 
efficient method for isolating novel secreted or mem- 
le-bound proteins. 
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[ON Of SIGNAL SEQUENCE TRAP 



After positive clones are screened by &e.JSj8T 
method, further discrimination should be re^uirjbdl to 
obtain novel cDNA clones encoding authentic Njtfciri »i- 
nal signal sequences (Fig. 4). The following strategy lis 
empirically recommended. The first step is seqiiencing 
the trapped cDNA clones. It should be verified if the 
clones have a reasonably long (more than 35 residii =>s) 
ORP in frame with the Tac sequence of the vector; If 
not, the clones should be abandoned as artificts. ' he 
next step is a homology search using available data- 
bases of published genes and proteins. The cloto^s iden- 
tical to the known sequences should not be character- 
ized further. The third test is selection by the nydrcj ?a- 
thy profile. Clones that have no hydrophobic peak at 
the N-terminus should be eliminated. In thi^ way j lie 
clones that appear to contain authentic signal j Se- 
quences are selected, and their full-length cjoites j are 
isolated from a full-length cDNA library. Thejpres€f:*ce 
of an in-frame stop codon upstream of the Irst A TG 
codon is not mandatory because three unknown clb^es 
with authentic signal sequences (SDR1, S£>ir2, Kind 
SDF3) did not have upstream in-frame stop ccjdpns {see 
Table 1). | ; j 

The biggest problem in a random screening |a£pro %ch 
is to determine the functions of isolated clones. Sevisral 
strategies will facilitate the understanding otfthe ft Ac- 
tions of these molecules: patterns of tissue disfcribut on, 
overexpression in appropriate cell lines, generation of 
antibodies against the expressed proteins, Said ^ene 
targeting. Among the novel genes identified l^erei we 
gained some hints about the functions of SUF3 imd 
SDR1 from their homology with known gerie? Sj)F3 
had remarkable homology to PEDF, suggesting j;hat 
SDF3 is the mouse counterpart for human PfeDF (Fig. 
2A). PEDF is an extracellular neurotrophic frgejatj and 
a member of the serine protease inhibitor faimlj ac- 
cording to amino acid sequence similarity, but recoinbi- 
nant PEDF did not inhibit any known serine; proteges 
(Becerra et al, 1993). The tissue distribution of hu nan 
PEDF mDNA had not yet been reported except for; IPE 
cells (Tombran-Tink et al, 1994), but mouse SDF3 
mRNA was detected in all the organs tested, suggej ting 
that SDF3 may have other functions in additioii itoi aeu- 
rotrophic activity. SDR1 belongs to the immunogbbu- 
lin superfamily and has considerable hojmjology to 
mouse basigin and chicken HT7. The mot*0 baiigin 
was first identified as a membrane protein (fe^42); rom 
a mouse fibroblast expression library by screening with 
a polyclonal antiserum (Altruda et aL 9 1989). HT7 was 
identified as an antigen intensely expressed in the 
blood-brain barrier of the chicken and was) close) ^ re- 
lated to basigin according to the amino add|s£mili rity. 
As already indicated for basigin and HT7 (Miy&ukhi et 
al., 1991), the transmembrane and cytoiflatsmiiir do- 
mains were highly conserved among the three Mole- 
cules fFig. 2C). The amino acid sequences of bajsigin 
and HT7 showed 63 and 49% identity, rt&bectiyely, 
with that of SDR1. The functions of these riaoletfules 
are not yet known. The brain-specific 2.6-kli transcript 



may 

lated; 



Itle a splicing variant or a product of another re- 
jjene. 
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Abstract 

We describe a simple, rapid technique for simultaneously isolating large numbers of cDNAs encoding secreted proteins. The 
technique makes use of a facile genetic selection performed in a strain of Saccharomyces cerevisiae deleted for its endogenous 
invertase gene. A cDNA cloning vector which carries a modified invertase gene lacking its leader sequence is used in conjunction 
with this strain. Heterologous secreted genes fused appropriately upstream of this defective invertase provide the necessary signals 
to restore secretion, allowing the yeast to grow on sugars such as sucrose or raffinose. This microbial growth selection facilitates 
scanning cDNA libraries containing millions of clones, enabling the wholesale identification of novel secreted proteins without 
the need for specific bioassays. The technique is similar to one previously described (Klein et al. (1996) Proc. Natl. Acad. Sci. 
USA 93, 7108-7113). We describe results using a cDNA library derived from activated human peripheral blood mononuclear 
cells (PBMC). Genes identified from this library encoded signal sequences of proteins of diverse structure, function, and cellular 
location such as cytokines, type 1 and type 2 transmembrane proteins, and proteins found in intracellular organelles. In addition, 
a number of novel secreted proteins were identified, including a chemokine and a novel G-protein-coupled receptor. Since signal 
sequences possess features conserved throughout evolution, the procedure can be used to isolate genes encoding secreted proteins 
from both eukaryotes and prokaryotes. © 1997 Elsevier Science B.V. 

Keywords: Protein secretion; Invertase; Signal sequence; Chemokine; G-protein-coupIed receptor 



1. Introduction 

Many biological events revolve around intercellular 
signalling processes mediated by hormones or growth 
factors and their membrane-bound cellular receptors. 
Isolation of the genes encoding these proteins is of 
particular interest, not only for the purpose of under- 
standing the powerful biological processes which these 
molecules govern, but also for the therapeutic potential 
which they may hold. Because the repertoire of assays 
which can be employed for cloning purposes remains 
incomplete, however, many proteins which regulate 
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events occurring in vivo are beyond the scope of conven- 
tional cloning methods. Growth factors and their recep- 
tors, however, are all secreted proteins, which suggests 
an alternative cloning approach. 

According to the signal sequence hypothesis (Blobel 
and Dobberstein, 1975), certain peptide sequences, usu- 
ally located at the amino-terminus of nascent proteins, 
determine whether a particular protein is destined for 
secretion. Experiments confirming this hypothesis have 
shown that although these signal sequences are unique, 
they are largely interchangeable among secreted proteins 
and even between diverse organisms (Hitzeman et al., 
1990; Walter and Johnson, 1994; Rapoport et al., 1996). 
Computational analyses reveal that eukaryotic and pro- 
karyotic signal sequences are remarkably similar, provid- 
ing a rationale for these observations (von Heijne, 1985). 
In addition, biochemical experiments have demonstrated 
considerable functional interchangability between some 
components of the eukaryotic and prokaryotic protein 
secretion pathways (Bernstein et al., 1993; Hartmann 
et al., 1994). 
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The Yeast Signal Sequence Trap Vector 
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Fig. 1. The yeast signal sequence trap vector, pSUC2T7M130RI. The invertase gene in plasmid pRB58 (Kaiser et al., 1987) (base pairs 842-2386 
m GenBank accession No. V01311) was amplified using Vent polymerase (New England Biolabs, Beverly, MA, USA) and ligated to EcoRI and 
Xhol cut pJG4-l (Gyuris et al., 1993). Subsequently, the first two amino acids of mature invertase, serine and methionine, "were deleted and u 
synthetic ohgo encoding a bacteriophage T7 promoter and a hybrid human serum albumin-a factor Kex2 site (Hitzeman et al., 1990) was added 
The final modification was the incorporation of a bacteriophage Fl replication origin derived from pBJuescript (Stratagene, La Jolla CA USA) 
This vector was used in conjunction with the yeast strain YT455 (Kaiser et al., 1987) which was converted to TRP~ (Alani et al 1987) and then 
to URA (Boeke et al., 1987). 



The Saccharomyces cerevisiae enzyme invertase is an 
essential protein in certain metabolic environments. If 
sucrose or raffinose is provided as the sole source of 
carbon, yeast must secrete invertase to grow (Carlson 
et al., 1983). Because of this convenient genetic selection, 



invertase has been widely used to measure the effects of 
mutations within signal sequences (Kaiser et al., 1987; 
Perlman et al., 1986; Ngsee et al., 1989), to identify 
amino acid sequences that mediate localization to yeast 
vacuoles (Klionsky et al., 1988; Tague et al., 1990; 
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Saalbach et al., 1991) or to mitochondria (Emr et al., 
1986), and to monitor the amounts of human proteins 
expressed for commercial purposes (Hitzeman et al., 
1990). As a whole, these results demonstrated two 
important points: the genetic selection for invertase 
secretion is very sensitive, and invertase itself is quite 
tolerant of amino-terrninal extensions. We describe here 
a method which uses the invertase genetic selection to 
isolate efficiently and in high throughput, genes encoding 
secreted proteins. An effective but less efficient method 
of trapping signal sequences was first published by 
Tashiro et al. (1993). Another publication using a 
method similar to the one we describe and with similar 
results has also been published (Klein et al., 1996). 
Here we present data regarding the selection efficiency, 
cloning biases, and sequences of a novel human chemo- 
kine and a novel G-protem-coupled receptor discovered 
using the system. 

2. Results 

2. 1. Vector design and testing the invertase selection 

The vector pSUC2T7M130RI (Fig. 1) was designed 
to identify sequences in cDNA clones that mediate 
transport of proteins into the secretion pathway. The 
vector carries an invertase gene (SUC2) lacking both its 
initiating methionine codon and its signal peptide. 
Transcription of invertase is mediated by the yeast 
alcohol dehydrogenase promoter, but the invertase 
translation and secretion defect renders the plasmid 
unable to rescue the growth of a SUC2' yeast strain 
on YPR plates. SUC2~ yeast transformed with 
pSUC2T7M130RI, and placed under invertase selection 
produce no visible colonies appear even after 7 days of 
growth. When the native invertase signal sequence and 
initiator methionine are inserted into the cloning site, 
transformed SUC2~ yeast grow as well on raffinose as 
SUC2+ yeast, with a plating efficiency of 100% after 3 
days. By contrast, SUC2~ yeast carrying the vector 
modified to encode only an initiating methionine residue 
upstream of invertase (i.e., capable of producing only 
cytoplasmic invertase) produced visible colonies only 
after 8 days, and with a plating efficiency of just 2%. 
These differences in growth rates and plating efficiencies 
suggested that the growth selection would be effective 
at isolating those clones in a cDNA library containing 
natural signal sequences. 

2.2. Isolation ofcDNA clones encoding secreted proteins 

A cDNA library was constructed with mRNA isolated 
from activated human PBMC. This cell source was 
chosen as a test since it expresses a variety of known 
secreted cytokines, hormones, growth factors and cell 
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surface receptors. The library was transformed into 
SUC2~ yeast, and plasmids were isolated from colonies 
which survived the invertase selection. 

Two separate selections gave qualitatively similar 
results. In selection one, 1.9 xlO 6 transformed yeast 
(i.e., TRP~) were subjected to invertase selection, and 
a total of 147 SUC2 + clones were recovered. Of these, 
136 (92%) corresponded to database entries of well- 
characterized genes and proteins. The incidence of signal 
sequences among this set provided a measure of the 
fidelity of the invertase selection, the true positive rate. 
Among this set, 83% of the cDNAs encoded proteins 
known to contain signal sequences. A representative 
number of these were further analyzed to verify that 
they were fused in-frame to invertase, consistent with 
their signal sequences mediating the secretion of inver- 
tase. in the second run, 4.8 x 10° TRP^ yeast were 
selected, 457 SUC2^ clones were analyzed, and 424 
(93%) were informative. Among this set, 83% again 
encoded proteins known to contain signal sequences. 
Table 1 provides a combined listing of the known genes 
identified from these selections. The list includes secreted 
proteins, such as interleukins and chemokines, type 1 
and multipass transmembrane proteins, and proteins 
found in intracellular organelles. Significantly, the list 
also includes type 2 transmembrane proteins such as 
Fas ligand. In common with other type 2 membrane 
proteins (Parks, 1996) Fas ligand has no discernable 
N-terminal signal sequence. Nevertheless, the molecule 
was isolated by the yeast signal sequence trap by virtue 
of its being fused to invertase via a serine residue at 
position 153. Fas ligand must therefore possess an 
uncharacterized motif within its sequence (before residue 
153) which is capable of mediating secretion in yeast. 

A minority (17%) of isolated clones were false posi- 
tives, i.e., clones encoding proteins known not to be 
secreted. Most of these false positives encoded an initiat- 
ing methionine adventitiously followed by an amino 
acid sequence that mimicked a signal sequence. These 
mimic sequences were found to be derived either from 
3 '-untranslated regions or from coding regions, with the 
latter often employing a reading frame different from 
the one used in vivo (data not shown). The presence of 
false positives is not surprising in light of the observation 
that 20% of randomly generated peptides can function 
as signal sequences in yeast if an initiating methionine 
is provided (Kaiser et al., 1987). Perhaps more surpris- 
ing is that, despite this, we observe a greater than 80% 
specificity for signal sequences with the yeast signal 
sequence trap system. 

J.J. Extent of recovery of c UNA clones encoding signal 
sequences 

Given the known patterns of gene expression in 
PBMC, the list of genes isolated by the yeast signal 
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Secreted proteins identified in a human PBMC library- 






Accession No. 


Protein 


Accession No. 


Protein 


Extracellular: cytokines 


M97720 


T cell receptor Va 


M 13982 


IL-4 


Ml 5565 


T cell receptor Va 


X04688 


IL-5 


68696 


T cell receptor Va 


X 1 7543 


IL-9 


07294 


T cell receptor Vfi 


XI 3274 


V-IFN 


3 7797 


RP105 ( homolog) 


X 10394 


Tumor necrosis factor 






XOI393 


Lymphotoxin 


Cell surface: type 11 transmembrane proteins 


U32659 


IL-17 (homolog) 


U11276 


NKR-Pla protein 






1:08137 


Fas ligand 


Extracellular: chemokines 


X67878 


CD40 ligand 


Z11686 


IL-8 


Ml 4766 


Fc epsilon RII (CD23) 


J04130 


Act-2 


K01 144 


MHC class II invariant chain (CD74) 


X72755 


HuMig 


L07555 


Activation inducer molecule (CD69) 


X64885 


R ANTES 


HI 1808 


Activation inducer molecule (CD69) (homolog 


X 14768 


MCP-1 






X71087 


MCP-3 


Cell surface: multi-pass transmembrane proteins 


M23178 


MlP-la 


X71635 


Neuropeptide Y-like receptor 


M24110 


MIP-I2 homolog-2 


X07982 


LIMP (CD63) 


X72755 


HuMig (homolog) 


H33447 


Putative G-coupled protein receptor (homolog 






Intracellular: o 


rganelle proteins 


Cell surface: type I transmembrane proteins 


U35237 


Tryptase 2 


Ml 43 62 


LFA-2 (CD2) 


M90696 


Cathepsin S (CTSS) 


M23461 


Leukocyte Common Antigen (CD45) 


X74lf>4 


Translocon associated protein fi 


M33195 


Fc RI (CD64) 


J04071 


Serine esterase 


LI 5006 


CTLA-4 


H65908 


Oligopeptide transporter (homolog) 


Z22968 


Ml 30 Antigen 






U03397 


4- IBB Receptor 


Miscellaneous: 




D11086 


IL-2 Receptor gamma 


X 14584 


Immunoglobulin /i-chain 


S93788 


Ocular melanoma associated antigen 


X57809 


Immunoglobulin /1-chain 


S82300 


j$2 microglobulin 


Ml 2759 


Ig J chain 


U09087 


Thymopoietin fi 


XI 7042 


Hematopoietic proteoglycan core protein 


X 54890 


Leukocyte common antigen related peptide 


J04621 


Heparan sulfate proteoglycan core protein 


X0O457 


HLA-SB oc-chain 


D 14043 


MGC-24 (peanut agglutinin binding protein) 


JOG 194 


HLA-DR 2-chain 


XI 3694 


Osteopontin 


M60334 


HLA-DR 2-chain 


A 101 56 


Lysozyme 


KOI 172 


HLA-DS a-chain 


X02530 


V-IFN inducible early response gene 


M81140 


HLA-DQ ^-chain 


X67698 


Tissue specific secretory protein (HE1) 


X58767 


T cell receptor Va 


D28593 


Mannose binding protein associated serine 


X58746 


T cell receptor Wot 




protease (homolog) 


X58738 


T cell receptor Va 


U01235 


Pre-pro hemorrhagic toxin ( homolog) 


M87477 


T cell receptor Vac 


R 16862 


OTS-8 (homolog) 



Proteins of known cellular location isolated by the yeast signal sequence trap from a human PBMC cDNA library. Total RNA was isolated 
(Chomczynski and Sacchi, 1987) from human PBMC activated by culture in ConA and PMA. PolyA* mRNA, isolated with the Poly ATtr act 
mRNA isolation system (Promega, Madison, WI, USA), was converted to double-stranded cDNA using the Superscript Choice System (Gibco 
BRL, Gaithersburg, MD, USA) and the oligo/random primer 5 -TCCCGATTGAATTCTAGACCTGCGTCGACNNNNNN-3'. EcoKl adaptors 
were ligated onto the cDNA which was then digested with Sail: 300 600 base pair fragments were ligated into pSUC2T7M 1 30RI pre-digested 
with EcoK\ and Xhol. Electrocompetent DH10B cells (Gibco BRL) were transformed, and 5 x 10 6 colonies were pooled to prepare the amplified 
plasmid cDNA library (Sambrook et al., 1989). Yeasts transformed using lithium acetate were selected by growth on complete minimal plates 
lacking tryptophan (CMD-W) (Ausubel et al., 1995) for 3 days before being replicated onto YPR plates (1% yeast extract. 2% peptone, 2 c/ 0 
rafrinose, 2 ug/ml Antimycin A (cat no. A-8674; Sigma, St. Louis, MO, USA)) to select for plasmids encoding signal sequences. Seven days late r 
yeast colonies were purified by streaking onto these same plates. Individual colonies were patched onto CMD-W plates, grown overnight, and 
then inoculated into deep-well trays to prepare plasmid DNA essentially as described (Blanchard and Nowotny, 1994). The cDNA inserts were 
amplified with Taq polymerase (Perkin-Elmer. Norwalk, CT, USA) and sequenced utilizing the chain-termination method (Sanger et al., 1977). 
DNA sequences were compared with sequences in GenBank and GenPept utilizing the programs BlastN and BlastX (Altschul et al., 1990) and 
FastA (Pearson and Lipman, 1988) 



sequence trap shown in Table 1 is certainly incomplete. 
The two selections were not intended to be an exhaustive 
search for all genes encoding signal sequences expressed 
by PBMC; nevertheless there were some expected genes 



that were not recovered (i.e., false negatives). Possible 
reasons for false negatives include; the complete absence 
of a cDNA from the library, an insufficient sample size 
for the yeast selections, or the existence of poorly 
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expressed, poorly secreted, or inactive invertase fusions 
for certain genes. To address some of these questions, 
we measured the frequencies of 1 1 genes expected to be 
present in the starting library by hybridization to 50 000 
E. coli colonies. y-IFN and IL-8 were very abundant 
both in the starting library, at frequencies of 1:300 and 
1 :2000, respectively, and in the clones recovered 
following the selections, frequencies of 1:1.6 and 1:20, 
respectively. Although clones encoding GM-CSF, IL-2, 
IL-6, and osteopontin were present in moderate abun- 
dance in the starting library, 1:10 000-1:20 000, only 
osteopontin was recovered by the yeast signal sequence 
trap in these two selections. However, we have isolated 
IL-2 and IL-6 in subsequent selections. Of five genes 
present at low frequency in the starting library less than 
or equal to 1:50 000, three (IL-4, IL-5, and TNF) were 
isolated in these two selections and the two remaining 
(IL-3 and IL-10) in subsequent selections. This accounts 
for 10 of the 11 genes. Thus, the yeast signal sequence 
trap does isolate rarely expressed genes, but the particu- 
lar genes isolated are partly a function of sample size. 

We have not yet isolated a clone of GM-CSF, indicat- 
ing that the system can exhibit cloning bias (see 
Section 3). One possibility, that certain structural classes 
of protein may be impossible to isolate using the system, 
is unlikely given the wide structural diversity of proteins 
whose genes have been selected. Another is that particu- 
lar cDNA fusions of specific proteins could deleteriously 
affect invertase either through steric hindrance or by 
causing folding defects. Proteins are so diverse that it is 
difficult to contrive a universal solution to this problem, 
but as a safeguard we have incorporated a Kex2 proteo- 
lytic processing site at all fusion junctions between 
cDNAs and invertase. Kex2 is an endogenous yeast 
protease that cleaves proteins with appropriately pre- 
sented pairs of basic amino acids as they pass through 
the Golgi (Hitzeman et al., 1990; Redding et al., 1991 ). 
Cleavage of invertase fusion proteins by Kex2 during 
secretion should not only reduce the possibility of enzy- 
matically inactive fusions, but should also reduce and 
normalize any subtle effects of particular fusions on 
invertase specific activity. 

Because so many clones of y-IFN were recovered in 
the two selections, we had an opportunity to examine 
many independent y-IFN cloning events to investigate 
any potential bias of the system in the selection of 
particular y-IFN/ invertase fusion junctions. Data pre- 
sented in Fig. 2 revealed the yeast signal sequence trap 
to be very permissive in the position of these fusion 
junctions, with fusion points dispersed uniformly 
throughout the y-IFN sequence. The same analysis 
performed on a number of IL-8 clones gave similar 
results, suggesting that any biases introduced by the 
yeast selection are more likely caused by cDNA expres- 
sion issues rather than by the character of the protein 
fragments fused to invertase. 
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2. 4. Description of novel homologs 

Novel homologs of several genes were isolated from 
the two selections performed on the PBMC-derived 
library. The full-length protein sequences for two of 
these novel molecules are presented in Fig. 3. Clone 
HI 74 (Fig. 3a) is homologous to the a-chemokine 
HuMig (Liao ct al., 1995). The sequence of HI 74 
contains both a good predicted signal sequence and the 
C-X-C motif characteristic of a-chemokines. It is iden- 
tical to a gene of unknown function designated /f-Rl, 
which is selectively induced by /MFN (Rani et al., 1996). 

Clone H963 (Fig. 3b) is a novel human seven trans- 
membrane receptor. The figure shows an alignment of 
H963 to the platelet activating receptor (Ye et al., 1991), 
which is its closest human homolog with defined func- 
tion. The predicted fuii-iength open reading frame tor 
H963 contains the seven predicted hydrophobic trans- 
membrane segments characteristic of this receptor class. 
Interestingly, although the original fusion to invertase 
formed after residue 129 of the H963 sequence, the 
5'-end of the fused cDNA fragment in the original 
construct did not include the authentic initiating methio- 
nine codon. Instead, the yeast utilized the methionine 
at residue 98 to initiate translation of the fusion, and 
secretion was mediated by sequences in the third trans- 
membrane segment acting as a signal peptide. There is 
a predicted signal sequence cleavage site immediately 
after this third transmembrane segment following resi- 
due 115 (SigCleave score 3.3). H963 is not unique in 
being isolated using a transmembrane segment; we have 
isolated several other receptor molecules via this route. 



3. Discussion 

The yeast signal sequence trap cloning approach is 
rapid, simple, accurate, and identifies cDNAs encoding 
signals that mediate the transport and secretion of 
proteins diverse in structure, function, and cellular loca- 
tion. We have isolated clones encoding secreted proteins, 
type 1 transmembrane proteins, type 2 transmembrane 
proteins, multipass transmembrane proteins, and pro- 
teins located in intracellular organelles, A small percen- 
tage of the isolates are false positives, arising from 
clones adventitiously encoding peptides that mimic 
signal sequences. 

Several papers describing effective and innovative but 
less efficient methods of trapping signal sequences have 
been published. The COS-based systems (Tashiro et al., 
1993; Yokoyama-Kobayashi et al., 1995) are limited to 
pools of 200 clones or less pei liansfeciion, whiie the 
ES-based system (Skarnes et al., 1995) is limited by the 
necessity of generating and screening mammalian cell 
lines. Each of these methods is a screen. Our system, 
which is similar in many respects to the one reported 
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151 KRKRSQHLFQ ORRASQ 

Fig. 2. Many points of fusion of y-IFN to invertase are compatible with the genetic selection. The amino acid sequence of y-IFN is marked with 
arrows to denote the positions fused to invertase. Forty-three y-IFN clones recovered following invertase selection were sequenced, and 20 
independent fusion sites were identified. 



by Klein et al. (1996), is a selection, and consequently 
more sensitive and by design less labor intensive. We 
select for signal sequences among millions of trans- 
formed yeast, and positives are isolated simply by streak- 
ing colonies onto selective medium. Moreover, the yeast 
system identifies a more diverse repertoire of secreted 
sequences, as adduced by the isolation of genes encoding 
type 2 transmembrane proteins. 

False negatives, i.e., genes encoding secreted proteins 
which are not identified, are a limitation of any signal- 
sequence cloning scheme, but may be minimal with the 
yeast system. The wide structural diversity of the known 
proteins identified using the yeast signal-sequence trap 
argues strongly against any broad class of proteins being 
totally unrepresented. Moreover, invertase activity is 
tolerant of many different amino terminal extensions. 
Kaiser et al. (1987) selected for functioning signal 
sequences derived from randomly sheared human geno- 
mic DNA and estimated that 20% of all open reading 
frames functioned well enough as signal sequences to 
mediate secretion of invertase, as measured by the 
genetic selection (Kaiser et al., 1987). In our experiments 
we also have observed that many diverse points of 
fusion between y-IFN and invertase are readily selected 
(Fig. 2). Even if a particular fusion were to deleteriously 
effect invertase activity, it is likely that alternative fusions 
of the same cDNA to invertase would be fully active. 
The consequence may be a reduced frequency for that 
cDNA in the yeast selections, but it should still be 
recoverable. 

False negatives may also be caused by a failure of the 



yeast translational apparatus to initiate on heterologous 
mRNA sequences. However unlike E. coli (McCarthy 
and Gualerzi, 1990) and mammalian ribosomes (Kozak, 
1989), yeast ribosomes do not require specific nucleotide 
motifs to initiate translation. The only identified criteria 
for translation initiation in yeast is a preference, but not 
an absolute requirement, for an AU-rich 5 '-untranslated 
region (Yoon and Donahue, 1992). We suggest, there- 
fore, that the efficiencies of heterologous translation 
initiation and secretion in yeast are not significant 
impediments to identifying clones. 

The class of genes selected by the yeast signal sequence 
trap is enriched proteins involved in cell growth, differ- 
entiation, and intercellular communication, important 
cellular processes which are of high scientific and com- 
mercial interest. Since genes encoding secreted proteins 
represent only a fraction, perhaps 10%, of the total 
genome, and are encoded by rarely expressed genes, the 
yeast signal sequence trap is more efficient at identifying 
them than random sequencing. Additionally, since the 
yeast signal sequence trap isolates the 5 '-end of mRNAs, 
it provides an important advantage in isolating full- 
length cDNAs, which can then be biologically tested. 
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Fig. 3. Sequence alignment of novel genes identified by the Signal Sequence Trap to close homologs. Alignments were determined by FastA searches 
(Pearson and Lipman, 1988) against GenBank. Regions of identity are indicated by colons (:), conservative amino acid changes by single dots ( ). 
Closed arrowheads indicate the signal sequence cleavage sites of the novel clones as determined by SigCleave (Program Manual for the Wisconsin 
Package, 1994); open arrowheads show the fusion junctions to invertase. (a) Comparison of the amino acid sequence of novel clone HI 74 to 
HuMig, an oc-chemokine. Solid bars highlight the cysteine residues, (b) Comparison of the amino acid sequence of novel clone H963 to human 
platelet activating factor receptor, a seven transmembrane receptor family member. Shaded bars delineate predicted transmembrane segments. The 
black dot shows the methionine used for translation initiation in the yeast selection; the asterix denotes the 5-end of the original partial cDNA 
clone. The cDNA sequences arc available in GenBank under accession Nos. AF002985 for HI 74 and AF002986 for H963. 
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PCT/US98/22376 



The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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This application is a continuation-in-part of application Ser. No. 60/XXX,XXX 
(converted to a provisional application from non-provisional application Ser. No. 
08/815,047), filed March 14, 1997. 

2 0 FIELD OF THE INVENTION 

The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

25 BACKGROUND OF THE INVENTION 

Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 
clone novel polynucleotides "directly" in the sense that they rely on information directly 

3 0 related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein 

in the case of hybridization cloning; activity of the protein in the case of expression 
cloning). More recent "indirect" cloning techniques such as signal sequence cloning, which 
isolates DNA sequences based on the presence of a now well-recognized secretory leader 
sequence motif, as well as various PCR-based or low stringency hybridization cloning 
3 5 techniques, have advanced the state of the art by making available large numbers of 
DNA/amino acid sequences for proteins that are known to have biological activity by 
virtue of their secreted nature in the case of leader sequence cloning, or by virtue of the 
cell or tissue source in the case of PCR-based techniques. It is to these proteins and the 
polynucleotides encoding them that the present invention is directed. 

40 
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SUMMARY OF THE INVENTION 
In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

5 NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 533 to nucleotide 673; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 596 to nucleotide 673; 

10 (d) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l from nucleotide 1 to nucleotide 664; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bd379_l deposited under accession 
number ATCC 98361; 

15 (f) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone bd379_l deposited under accession number ATCC 98361; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone bd379_l deposited under accession number 
ATCC 98361; 

2 0 (h) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone bd379_l deposited under accession number ATCC 98361; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(j) a polynucleotide encoding a protein comprising a fragment of the 

2 5 amino acid sequence of SEQ ID NO:2 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

3 0 (m) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:l from nucleotide 533 to nucleotide 673; the nucleotide sequence of SEQ ID NO:l from 
nucleotide 596 to nucleotide 673; the nucleotide sequence of SEQ ID NO:l from 

2 
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nucleotide 1 to nucleofo^664; the nucleotide sequence of the J^^ength protein coding 
sequence of clone bd379_l deposited under accession number ATCC 98361; or the 
nucleotide sequence of the mature protein coding sequence of clone bd379_l deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
5 polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone bd379_l deposited under accession number ATCC 98361. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:2 from amino acid 1 to amino acid 44. 
Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
10 IDNO:l. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

15 (b) the amino acid sequence of SEQ ID NO:2 from amino acid 1 to 

amino acid 44; 

(c) fragments of the amino acid sequence of SEQ ID NO:2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bd379_l deposited under accession number ATCC 98361; 

2 0 the protein being substantially free from other mammalian proteins. Preferably such 

protein comprises the amino acid sequence of SEQ ID NO:2 or the amino acid sequence 
of SEQ ID NO:2 from amino acid 1 to amino acid 44. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
25 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 55 to nucleotide 1008; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO:3 from nucleotide 952 to nucleotide 1008; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 403 to nucleotide 981; 
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(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bpl21_2 deposited under accession 
number ATCC 98361; 

(f) a polynucleotide encoding the full-length protein encoded by the 
5 cDNA insert of clone bpl21_2 deposited under accession number ATCC 98361; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone bpl21_2 deposited under accession number 
ATCC 98361; 

(h) a polynucleotide encoding the mature protein encoded by the 
1 0 cDNA insert of clone bpl21_2 deposited under accession number ATCC 98361; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity; 
15 (k) a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
2 0 to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 55 to nucleotide 1008; the nucleotide sequence of SEQ ID NO:3 from 
nucleotide 952 to nucleotide 1008; the nucleotide sequence of SEQ ID NO:3 from 
nucleotide 403 to nucleotide 981; the nucleotide sequence of the full-length protein coding 

2 5 sequence of clone bpl21_2 deposited under accession number ATCC 98361; or the 

nucleotide sequence of the mature protein coding sequence of clone bpl21_2 deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone bpl21_2 deposited under accession number ATCC 98361. In yet other preferred 

3 0 embodiments, the present invention provides a polynucleotide encoding a protein 

comprising the amino acid sequence of SEQ ID NO:4 from amino acid 119 to amino acid 
309. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:3. 

4 
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In other eml^^rments, the present invention provid^^^omposition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

5 (b) the amino acid sequence of SEQ ID NO:4 from amino acid 119 to 

amino acid 309; 

(c) fragments of the amino acid sequence of SEQ ID NO:4; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bpl21_2 deposited under accession number ATCC 98361; 

10 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:4 or the amino acid sequence 
of SEQ ID NO:4 from amino acid 119 to amino acid 309. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
15 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 52 to nucleotide 639; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
2 0 NO:5 from nucleotide 1 to nucleotide 308; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bp646_10 deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 

2 5 cDNA insert of clone bp646_10 deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone bp646_10 deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 

3 0 cDNA insert of clone bp646_10 deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:6; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:6 having biological activity; 
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(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

5 (1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 52 to nucleotide 639; the nucleotide sequence of SEQ ID NO:5 from 
nucleotide 1 to nucleotide 308; the nucleotide sequence of the full-length protein coding 

10 sequence of clone bp646_10 deposited under accession number ATCC 98361; or the 
nucleotide sequence of the mature protein coding sequence of clone bp646_10 deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone bp646_10 deposited under accession number ATCC 98361. In yet other preferred 

15 embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:6 from amino acid 1 to amino acid 86. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:5. 

In other embodiments, the present invention provides a composition comprising 
2 0 a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) the amino acid sequence of SEQ ID NO:6 from amino acid 1 to 
amino acid 86; 

2 5 (c) fragments of the amino acid sequence of SEQ ID NO:6; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bp646„10 deposited under accession number ATCC 98361; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:6 or the amino acid sequence 

3 0 of SEQ ID NO:6 from amino acid 1 to amino acid 86. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 
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(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 134 to nucleotide 1183; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 191 to nucleotide 1183; 

5 (d) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7 from nucleotide 1 to nucleotide 763; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone cf50„l deposited under accession number 
ATCC 98361; 

10 (f) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone cf50_l deposited under accession number ATCC 98361; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone rf 50_1 deposited under accession number ATCC 
98361; 

15 (h) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone cf50_l deposited under accession number ATCC 98361; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:8; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
2 0 amino acid sequence of SEQ ID NO:8 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

2 5 (m) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 134 to nucleotide 1183; the nucleotide sequence of SEQ ID NO:7 
from nucleotide 191 to nucleotide 1183; the nucleotide sequence of SEQ ID NO:7 from 

3 0 nucleotide 1 to nucleotide 763; the nucleotide sequence of the full-length protein coding 

sequence of clone cf50_l deposited under accession number ATCC 98361; or the 
nucleotide sequence of the mature protein coding sequence of clone cf50_l deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
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clone cf50__l deposited under accession number ATCC 98361. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:8 from amino acid 1 to amino acid 210. 
Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
5 ID NO:7. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:8; 

10 (b) the amino acid sequence of SEQ ID NO:8 from amino acid 1 to 

amino acid 210; 

(c) fragments of the amino acid sequence of SEQ ID NO:8; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
cf50_l deposited under accession number ATCC 98361; 

1 5 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:8 or the amino acid sequence 
of SEQ ID NO:8 from amino acid 1 to amino acid 210. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
2 0 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:9; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 740 to nucleotide 2245; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

2 5 NO:9 from nucleotide 1 to nucleotide 463; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone cwl543_3 deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 

3 0 cDNA insert of clone cwl543_3 deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone cwl543_3 deposited under accession number 
ATCC 98361; 
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(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone cwl543_3 deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

5 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO: 10 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
10 of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 740 to nucleotide 2245; the nucleotide sequence of SEQ ID NO:9 

15 from nucleotide 1 to nucleotide 463; the nucleotide sequence of the full-length protein 
coding sequence of clone cwl543_3 deposited under accession number ATCC 98361; or 
the nucleotide sequence of the mature protein coding sequence of clone cwl543_3 
deposited under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 

2 0 clone cwl543_3 deposited under accession number ATCC 98361. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:9. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 

2 5 consisting of: 

(a) the amino acid sequence of SEQ ID NO:10; 

(b) fragments of the amino acid sequence of SEQ ID NO:10; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
cwl543_3 deposited under accession number ATCC 98361; 

3 0 the protein being substantially free from other mammalian proteins. Preferably such 

protein comprises the amino acid sequence of SEQ ID NO:10. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
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(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:ll; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 11 from nucleotide 952 to nucleotide 1074; 

5 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:ll from nucleotide 524 to nucleotide 1059; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone da389_l deposited under accession 
number ATCC 98361; 

10 (e) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone da389_l deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone da389_l deposited under accession number 
ATCC 98361; 

15 (g) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone da389_l deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 12; 

(i) a polynucleotide encoding a protein comprising a fragment of the 

2 0 amino acid sequence of SEQ ID NO: 12 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

25 (1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 952 to nucleotide 1074; the nucleotide sequence of SEQ ID NO:ll 
from nucleotide 524 to nucleotide 1059; the nucleotide sequence of the full-length protein 

3 0 coding sequence of clone da389_l deposited under accession number ATCC 98361; or the 

nucleotide sequence of the mature protein coding sequence of clone da389__l deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone da389_l deposited under accession number ATCC 98361. In yet other preferred 
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embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO: 12 from amino acid 1 to amino acid 
36. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
5 IDNO:ll. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:12; 

10 (b) the amino acid sequence of SEQ ID NO:12 from amino acid 1 to 

amino acid 36; 

(c) fragments of the amino acid sequence of SEQ ID NO:12; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
da389_l deposited under accession number ATCC 98361; 

15 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:12 or the amino acid sequence 
of SEQ ID NO:12 from amino acid 1 to amino acid 36. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

2 0 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:13; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 13 from nucleotide 78 to nucleotide 1619; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
25 NO: 13 from nucleotide 604 to nucleotide 1307; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone dd71_2 deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 

3 0 cDNA insert of clone dd71_2 deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone dd71_2 deposited under accession number 
ATCC 98361; 
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(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone dd71_2 deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 14; 

5 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO: 14 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
10 of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 78 to nucleotide 1619; the nucleotide sequence of SEQ ID NO:13 

1 5 from nucleotide 604 to nucleotide 1307; the nucleotide sequence of the full-length protein 
coding sequence of clone dd71_2 deposited under accession number ATCC 98361; or the 
nucleotide sequence of the mature protein coding sequence of clone dd71_2 deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 

2 0 clone dd71_2 deposited under accession number ATCC 98361. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:14 from amino acid 200 to amino acid 
410. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 

2 5 IDNO:13. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:14; 

3 0 (b) the amino acid sequence of SEQ ID NO:14 from amino acid 200 to 

amino acid 410; 

(c) fragments of the amino acid sequence of SEQ ID NO: 14; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
dd71_2 deposited under accession number ATCC 98361; 
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the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:14 or the amino acid sequence 
of SEQ ID NO:14 from amino acid 200 to amino acid 410. 

In one embodiment, the present invention provides a composition comprising an 
5 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 1003 to nucleotide 1350; 

10 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15 from nucleotide 714 to nucleotide 1320; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
leneth protein coding sequence of clone dm221_l deposited under accession 
number ATCC 98361; 

15 (e) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone dm221_l deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone dm221_l deposited under accession number 
ATCC 98361; 

2 0 (g) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone dm221_l deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 16; 

(i) a polynucleotide encoding a protein comprising a fragment of the 

2 5 amino acid sequence of SEQ ID NO:16 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

3 0 (1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 1003 to nucleotide 1350; the nucleotide sequence of SEQ ID NO:15 
from nucleotide 714 to nucleotide 1320; the nucleotide sequence of the full-length protein 
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coding sequence of clone dm221_l deposited under accession number ATCC 98361; or the 
nucleotide sequence of the mature protein coding sequence of clone dm221_l deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
5 clone dm221_l deposited under accession number ATCC 98361. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO: 16 from amino acid 1 to amino acid 
106. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
10 IDNO:15. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 16; 

15 (b) the amino acid sequence of SEQ ID NO:16 from amino acid 1 to 

amino acid 106; 

(c) fragments of the amino acid sequence of SEQ ID NO:16; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
dm221_l deposited under accession number ATCC 98361; 

2 0 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:16 or the amino acid sequence 
of SEQ ID NO: 16 from amino acid 1 to amino acid 106. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

2 5 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:17; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 17 from nucleotide 449 to nucleotide 1006; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO: 17 from nucleotide 1 to nucleotide 331; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone dx279_l deposited under accession 
number ATCC 98361; 
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(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone dx279_l deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone dx279_l deposited under accession number 

5 ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone dx279_l deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:18; 

10 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO: 18 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
15 of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 449 to nucleotide 1006; the nucleotide sequence of SEQ ID NO:17 
2 0 from nucleotide 1 to nucleotide 331; the nucleotide sequence of the full-length protein 
coding sequence of clone dx279_l deposited under accession number ATCC 98361; or the 
nucleotide sequence of the mature protein coding sequence of clone dx279_l deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 

2 5 clone dx279_l deposited under accession number ATCC 98361. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:17. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 

3 0 consisting of: 

(a) the amino acid sequence of SEQ ID NO:18; 

(b) fragments of the amino acid sequence of SEQ ID NO:18; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
dx279 1 deposited under accession number ATCC 98361; 
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the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:18. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
5 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:19; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 19 from nucleotide 74 to nucleotide 865; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
1 0 NO:19 from nucleotide 538 to nucleotide 1044; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone gm243_l deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
15 cDNA insert of clone gm243_l deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone gm243„l deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 

2 0 cDNA insert of clone gm243__l deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity; 

25 (j) a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 

3 0 to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO: 19 from nucleotide 74 to nucleotide 865; the nucleotide sequence of SEQ ID NO: 19 
from nucleotide 538 to nucleotide 1044; the nucleotide sequence of the full-length protein 
coding sequence of clone gm243_l deposited under accession number ATCC 98361; or the 
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nucleotide sequence of the mature protein coding sequence of clone gm243_l deposited 
under accession number ATCC 98361. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone gm243_l deposited under accession number ATCC 98361. 
5 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

ID NO:19. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

1 0 (a) the amino acid sequence of SEQ ID NO:20; 

(b) fragments of the amino acid sequence of SEQ ID NO:20; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
gm243 1 deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. Preferably such 
15 protein comprises the amino acid sequence of SEQ ID NO:20. 

In certain preferred embodiments, the polynucleotide is operably linked to an 
expression control sequence. The invention also provides a host cell, including bacterial, 
yeast, insect and mammalian cells, transformed with such polynucleotide compositions. 
Processes are also provided for producing a protein, which comprise: 
2 0 (a) growing a culture of the host cell transformed with such 

polynucleotide compositions in a suitable culture medium; and 
(b) purifying the protein from the culture. 
The protein produced according to such methods is also provided by the present 
invention. Preferred embodiments include those in which the protein produced by such 

2 5 process is a mature form of the protein. 

Protein compositions of the present invention may further comprise a 
pharmaceutically acceptable carrier. Compositions comprising an antibody which 
specifically reacts with such protein are also provided by the present invention. 

Methods are also provided for preventing, treating or ameliorating a medical 

3 0 condition which comprises administering to a mammalian subject a therapeutically 

effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Fig, 1 is a schematic representation of the pED6 and pNOTs vectors used for 
deposit of clones disclosed herein. 

DETAILED DESCRIPTION 
ISOLATED PROTEINS AND POLYNUCLEOTIDES 

Nucleotide and amino acid sequences, as presently determined, are reported 
below for each clone and protein disclosed in the present application. The nucleotide 
sequence of each clone can readily be determined by sequencing of the deposited clone 
in accordance with known methods. The predicted amino acid sequence (both full-length 
and mature) can then be determined from such nucleotide sequence. The amino acid 
sequence of the protein encoded by a particular clone can also be determined by 
expression of the clone in a suitable host cell, collecting the protein and determining its 
sequence. For each disclosed protein applicants have identified what they have 
determined to be the reading frame best identifiable with sequence information available 
at the time of filing. 

As used herein a "secreted" protein is one which, when expressed in a suitable host 
cell, is transported across or through a membrane, including transport as a result of signal 
sequences in its amino acid sequence. "Secreted" proteins include without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell 
in which they are expressed. "Secreted" proteins also include without limitation proteins 
which are transported across the membrane of the endoplasmic reticulum. 

Clone "bd379 1" 

A polynucleotide of the present invention has been identified as clone "bd379_l'\ 
bd379_l was isolated from a human fetal kidney cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. bd379_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"bd379_l protein"). 

The nucleotide sequence of bd379_l as presently determined is reported in SEQ 
ID NO:l. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the bd379_l protein corresponding to the foregoing 
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nucleotide sequence is reported in SEQ ID NO:2. Amino acids 9 to 21 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 22, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
5 bd379_l should be approximately 1200 bp. 

The nucleotide sequence disclosed herein for bd379_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. bd379_l demonstrated at least some similarity with sequences 
identified as F05256 (H. sapiens partial cDNA sequence; clone c-05b06), R60369 
10 (yh04b03.rl Homo sapiens cDNA clone 42053 5'), and W39550 (zcl8g02.rl Soares 
parathyroid tumor NbHPA Homo sapiens cDNA clone 322706 5 ). Based upon sequence 
similarity, bd379_l proteins and each similar protein or peptide may share at least some 
activity. 



15 Clone"bpl21 2" 

A polynucleotide of the present invention has been identified as clone M bpl21_2". 
bpl21_2 was isolated from a human fetal kidney cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

2 0 analysis of the amino acid sequence of the encoded protein. bpl21_2 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"bpl21„2 protein"). 

The nucleotide sequence of bpl21„2 as presently determined is reported in SEQ 
ID NO:3. What applicants presently believe to be the proper reading frame and the 

2 5 predicted amino acid sequence of the bpl21_2 protein corresponding to the foregoing 

nucleotide sequence is reported in SEQ ID NO:4. Amino acids 287 to 299 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 300, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

3 0 bpl21_2 should be approximately 4175 bp. 

The nucleotide sequence disclosed herein for bpl21_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. bpl21_2 demonstrated at least some similarity with sequences 
identified as AA261860 (zs!8gl2.sl NCI CGAP_GCB1 Homo sapiens cDNA clone 



BNSDOC'D * WO 9840486A2 I > 



WO 98/40486 PCT/US98/04977 




IMAGE:685606 3'), AA478628 (zvl9g09.sl Soares NhHMPu SI Homo sapiens cDNA clone 
754144 3* similar to WP D1022.1 CE02575 UBIQUITIN-CONJUGATING ENZYME), 
H43996 (yo70hl0.rl Homo sapiens cDNA clone 183331 5'), N20622 (yx46f08.rl Homo 
sapiens cDNA clone 264807 5'), N34063 (yx78a05.rl Homo sapiens cDNA clone 267824 5' 
5 similar to D82419 similar to none), N57554 (yy81e07.sl Homo sapiens cDNA clone 279972 
3*), U23517 (Caenorhabditis elegans cosmid D1022), W19342 (zb90c09.sl Soares senescent 
fibroblasts NbHSF Homo sapiens cDNA clone 310864 3' similar to WP D1022.1 CE02575 
UBIQUITIN-CONJUGATING ENZYME), and W81357 (zd86c08.sl Soares fetal heart 
NbHH19W Homo sapiens cDNA clone 347534 3 f similar to WP D1022.1 CE02575 

1 0 UBIQUITIN-CONJUGATING ENZYME). The predicted amino acid sequence disclosed 
herein for bpl21_2 was searched against the GenPept and GeneSeq amino acid sequence 
databases using the BLASTX search protocol. The predicted bpl21_2 protein 
demonstrated at least some similarity to sequences identified as U23517 (similar to 
ubiquitin conjugating enzyme [Caenorhabditis elegans]) and W05315 (Ubiquitin 

15 conjugating enzyme 9). Based upon sequence similarity, bp!21_2 proteins and each 
similar protein or peptide may share at least some activity. The TopPredll computer 
program predicts a potential transmembrane domain within the bp!21_2 protein sequence 
centered around amino acid 110 of SEQ ID NO:4. 

2 0 Clone "bp646 10" 

A polynucleotide of the present invention has been identified as clone "bp646_10". 
bp646_10 was isolated from a human fetal kidney cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

2 5 analysis of the amino acid sequence of the encoded protein. bp646_10 is a full-length 

clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "bp646_10 protein"). 

The nucleotide sequence of bp646_10 as presently determined is reported in SEQ 
ID NO:5. What applicants presently believe to be the proper reading frame and the 

3 0 predicted amino acid sequence of the bp646_10 protein corresponding to the foregoing 

nucleotide sequence is reported in SEQ ID NO:6. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
bp646_10 should be approximately 1800 bp. 
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The nucleotide sequence disclosed herein for bp646„10 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. bp646_10 demonstrated at least some similarity with sequences 
identified as AA040456 (zk46fl0.rl Soares pregnant uterus NbHPU Homo sapiens cDNA 
5 clone 485899 5'), AA101294 (zn71f03.rl Stratagene NT2 neuronal precursor 937230 Homo 
sapiens cDNA clone 563645 5' similar to WP K07E3.6 CE04722 TRANSLOCATING 
ATPASE), A Al 79341 (zp48f01.sl Stratagene HeLa cell s3 937216 Homo sapiens cDNA 
clone 612697 3'), N54113 (yz02e02.rl Homo sapiens cDNA clone 281882 5'), T21123 
(Human gene signature HUMGS02428), U63315 (Rattus norvegicus 25-Dx (25Dx) mRNA, 

10 complete cds), X99714 (S.scrofa mRNA for steroid membrane binding protein), and 
Y12711 (H.sapiens mRNA for putative progesterone binding). The predicted amino acid 
sequence disclosed herein for bp646_10 was searched against the GenPept and GeneSeq 
amino acid sequence databases using the BLASTX search protocol. The predicted 
bp646_10 protein demonstrated at least some similarity to sequences identified as U63315 

1 5 (25-Dx [Rattus norvegicus]), X99714 (steroid membrane binding protein [Sus scrofa]), amd 
Y12711 (putative progesterone binding protein). Based upon sequence similarity, 
bp646_10 proteins and each similar protein or peptide may share at least some activity. 
The TopPredll computer program predicts a potential transmembrane domain within the 
bp646_10 protein sequence centered around amino acid 40 of SEQ ID NO:6. 

20 

Clone "cf50 1" 

A polynucleotide of the present invention has been identified as clone "cf50_l". 
cf50_l was isolated from a human adult placenta cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 

2 5 identified as encoding a secreted or transmembrane protein on the basis of computer 

analysis of the amino acid sequence of the encoded protein. cf50_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"cf50_l protein"). 

The nucleotide sequence of cf50_l as presently determined is reported in SEQ ID 

3 0 NO:7. What applicants presently believe to be the proper reading frame and the predicted 

amino acid sequence of the cf50_l protein corresponding to the foregoing nucleotide 
sequence is reported in SEQ ID NO:8. Amino acids 7 to 19 are a predicted leader/signal 
sequence, with the predicted mature amino acid sequence beginning at amino acid 20, or 
are a transmembrane domain. 
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The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
cf50_l should be approximately 1500 bp. 

The nucleotide sequence disclosed herein for cf50_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. cf50_l demonstrated at least some similarity with sequences 
identified as H15004 (yl26c09.sl Homo sapiens cDNA clone 159376 3'), H52859 (EST0013 
Homo sapiens cDNA clone HTN-6-19), and R86003 (ypl2c03.rl Homo sapiens cDNA 
clone 187204 5'). Based upon sequence similarity, cf50_l proteins and each similar protein 
or peptide may share at least some activity. 



Clone "cwl 543 3" 

A polynucleotide of the present invention has been identified as clone "cwl543_3 M . 
cwl543_3 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
15 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. cwl543_3 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as M cwl543_3 protein"). 

The nucleotide sequence of cwl543_3 as presently determined is reported in SEQ 
2 0 ID NO:9. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the cwl543_3 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:10. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
cwl543_3 should be approximately 3300 bp. 

2 5 The nucleotide sequence disclosed herein for cwl543_3 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. cwl543_3 demonstrated at least some similarity with sequences 
identified as AA021431 (ze68f09.sl Soares retina N2b4HR Homo sapiens cDNA clone 
364169 3' similar to PIR:A55626 A55626 monocarboxylate transporter MCT2 - golden 

3 0 hamster), R68272 (yi06c07.sl Homo sapiens cDNA clone 138444 3'), and U79304 (Human 

clone 23909 mRNA, partial cds). The predicted amino acid sequence disclosed herein for 
cwl543_3 was searched against the GenPept and GeneSeq amino acid sequence databases 
using the BLASTX search protocol. The predicted cwl543_3 protein demonstrated at least 
some similarity to sequences identified as U62316 (monocarboxylate transporter 2 [Rattus 
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norvegicus]), U79304 (unknown [Homo sapiens]), and AF000240 (monocarboxylate 
transporter 3 [Gallus gallus]). Based upon sequence similarity, cwl543_3 proteins and 
each similar protein or peptide may share at least some activity. 



5 Clone "da389 1" 

A polynucleotide of the present invention has been identified as clone "da389_l". 
da389_l was isolated from a human adult placenta cDNA library using methods which 
are selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
1 0 analysis of the amino acid sequence of the encoded protein. da389_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
M da389_l protein"). 

The nucleotide sequence of da389 1 as presently determinpd is reported in SEQ 
ID NO:ll. What applicants presently believe to be the proper reading frame and the 

15 predicted amino acid sequence of the da389_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:12. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
da389_l should be approximately 2000 bp. 

The nucleotide sequence disclosed herein for da389_l was searched against the 

2 0 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. da389_l demonstrated at least some similarity with sequences 
identified as R46114 (yg49g06.sl Homo sapiens cDNA clone 36151 3' similar to contains 
LI repetitive element ), R89713 (ym99h07.rl Homo sapiens cDNA clone 167101 5'), Z63670 
(H.sapiens CpG island DNA genomic Msel fragment, clone 89bll, forward read 

2 5 cpg89bll.ftla), and Z82170 (Human DNA sequence from PAC 326L13 containing brain-4 
mRNA ESTs and polymorphic CA repeat). Based upon sequence similarity, da389_l 
proteins and each similar protein or peptide may share at least some activity. The 
nucleotide sequence of da389_l indicates that it may contain a repetitive element. 

30 Clone "dd71 2" 

A polynucleotide of the present invention has been identified as clone M dd71__2". 
dd71_2 was isolated from a human adult testes cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
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analysis of the amino acid sequence of the encoded protein. dd71_2 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"dd71_2 protein"). 

The nucleotide sequence of dd71_2 as presently determined is reported in SEQ ID 
5 NO:13. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the dd71_2 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:14. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
dd71_2 should be approximately 1700 bp. 

1 0 The nucleotide sequence disclosed herein for dd71_2 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. dd71_2 demonstrated at least some similarity with sequences 
identified as AA011156 (ze34h02.rl Soares retina N2b4HR Homo sapiens cDNA clone 
360915 5'), H64206 (EST0047 Homo sapiens cDNA clone HTN-6-41), U40719 (Rattus 

15 norvegicus S-adenosylmeth), and Z31048 (M.musculus expressed sequence tag 
MTEST167). The predicted amino acid sequence disclosed herein for dd71_2 was 
searched against the GenPept and GeneSeq amino acid sequence databases using the 
BLASTX search protocol. The predicted dd71_2 protein demonstrated at least some 
similarity to sequences identified as L09190 (trichohyalin [Homo sapiens]). Based upon 

2 0 sequence similarity, dd71_2 proteins and each similar protein or peptide may share at 
least some activity. 



Clone M dm221 1" 

A polynucleotide of the present invention has been identified as clone "dm221_l". 

2 5 dm221_l was isolated from a human adult brain cDNA library using methods which are 

selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. dm221_l is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 

3 0 as M dm221_l protein"). 

The nucleotide sequence of dm221„l as presently determined is reported in SEQ 
ID NO:15. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the dm221__l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:16. 
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The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
dm221_l should be approximately 2500 bp. 

The nucleotide sequence disclosed herein for dm221_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
5 FASTA search protocols. dm221_l demonstrated at least some similarity with sequences 
identified as AA117998 (mn06h05.rl Beddington mouse embryonic region Mus musculus 
cDNA clone 537177 5'), AA164251 (zq46c05.sl Stratagene hNT neuron (#937233) Homo 
sapiens cDNA clone 632744 3* similar to contains Alu repetitive element), AA333321 
(EST37403 Embryo, 8 week I Homo sapiens cDNA 5' end), N93607 (zb69gll.sl Soares fetal 

10 lung NbHL19W Homo sapiens cDNA clone 308900 3'), U14568 (***ALU WARNING: 
Human Alu-Sb subfamily consensus sequence), U57007 (Human Ya5 subfamily Alu 
sequence), W20519 (zb26g03.rl Soares fetal lung NbHL19W Homo sapiens cDNA clone 
303220 5'), mtd VV25502 (zb69gll.rl Soares fetal lung NbHL19W Homo sapiens cDNA 
clone 308900 5'). The predicted amino acid sequence disclosed herein for dm221_l was 

15 searched against the GenPept and GeneSeq amino acid sequence databases using the 
BLASTX search protocol. The predicted dm221_l protein demonstrated at least some 
similarity to sequences identified as S58722 (X-linked retinopathy protein {C-terminal, 
clone XEH.8c} [human, Peptide Partial, 100 aa] [Homo sapiens]). Based upon sequence 
similarity, dm221_l proteins and each similar protein or peptide may share at least some 

2 0 activity. The nucleotide sequence of dm221_l indicates that it may contain an Alu 
repetitive element. 



Clone M dx279 1" 

A polynucleotide of the present invention has been identified as clone "dx279_l". 

2 5 dx279_l was isolated from a human adult testes cDNA library using methods which are 

selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. dx279_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 

3 0 "dx279_l protein"). 

The nucleotide sequence of dx279_l as presently determined is reported in SEQ 
ID NO:17. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the dx279_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:18. 
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The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
dx279_l should be approximately 1300 bp. 

The nucleotide sequence disclosed herein for dx279_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
5 FASTA search protocols. dx279_l demonstrated at least some similarity with sequences 
identified as AA255685 (zs22e05.rl NCI_CGAP_GCB1 Homo sapiens cDNA clone 685952 
5'), R46317 (yj53g03.rl Homo sapiens cDNA clone 152500 5'), and R67743 (yi28d02.rl 
Homo sapiens cDNA clone 140547 5 r ). Based upon sequence similarity, dx279_l proteins 
and each similar protein or peptide may share at least some activity. The TopPredll 
10 computer program predicts a potential transmembrane domain within the dx279_l 
protein sequence centered around amino acid 70 of SEQ ID NO:18. 

Clone ,, em243 1" 

A polynucleotide of the present invention has been identified as clone M gm243_l". 

1 5 gm243_l was isolated from a human adult uterus cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. gm243_l is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 

2 0 as "gm243„l protein"). 

The nucleotide sequence of gm243_l as presently determined is reported in SEQ 
ID NO:19. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the gm243_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:20. 

2 5 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

gm243__l should be approximately 3500 bp. 

The nucleotide sequence disclosed herein for gm243_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. gm243_l demonstrated at least some similarity with sequences 

3 0 identified as H39507 (yo54c09.rl Homo sapiens cDNA clone 181744 5'). Based upon 

sequence similarity, gm243_l proteins and each similar protein or peptide may share at 
least some activity. 
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Deposit of Clones 

Clones bd379_l, bpl21_2, bp646_10, cf50_l, cwl543_3, da389_l, dd71_2, 
dm221_l, dx279_l and gm243_l were deposited on March 13, 1997 with the American 
Type Culture Collection as an original deposit under the Budapest Treaty and were given 
5 the accession number ATCC 98361, from which each clone comprising a particular 
polynucleotide is obtainable. All restrictions on the availability to the public of the 
deposited material will be irrevocably removed upon the granting of the patent, except 
for the requirements specified in 37 C.F.R. § 1.808(b). 

Each clone has been transfected into separate bacterial cells (£. colt) in this 

1 0 composite deposit. Each clone can be removed from the vector in which it was deposited 
by performing an EcoRI/NotI digestion (5' site, EcoRI; 3' site, NotI) to produce the 
appropriate fragment for such clone. Each clone was deposited in either the pED6 or 
pMOTs vector depicted in Fig. 1. The pED6dpc2 vector ("pED6") was derived from 
pED6dpcl by insertion of a new polylinker to facilitate cDNA cloning (Kaufman et al, 

15 1991, Nucleic Acids Res. 19: 4485-4490); the pNOTs vector was derived from pMT2 
(Kaufman et ah, 1989, Moh Cell Biol 9: 946-958) by deletion of the DHFR sequences, 
insertion of a new polylinker, and insertion of the M13 origin of replication in the Clal site. 
In some instances, the deposited clone can become "flipped'" (i.e., in the reverse 
orientation) in the deposited isolate. In such instances, the cDNA insert can still be 

2 0 isolated by digestion with EcoRI and NotI. However, NotI will then produce the 5 1 site 
and EcoRI will produce the 3' site for placement of the cDNA in proper orientation for 
expression in a suitable vector. The cDNA may also be expressed from the vectors in 
which they were deposited. 

Bacterial cells containing a particular clone can be obtained from the composite 

2 5 deposit as follows: 

An oligonucleotide probe or probes should be designed to the sequence that is 
known for that particular clone. This sequence can be derived from the sequences 
provided herein, or from a combination of those sequences. The sequence of the 
oligonucleotide probe that was used to isolate each full-length clone is identified below, 

3 0 and should be most reliable in isolating the clone of interest. 



Clone Probe Sequence 

bd379_l SEQ ID NO:21 

bn121 2 SEQIDNO-22 
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dd71_2 



bp646_10 



cf50„l 



cwl543_3 



da389_l 



SEQ ID NO:23 
SEQ ID NO:24 
SEQ ID NO:25 
SEQ ID NO:26 
SEQ ID NO:27 



dx279_l 



dm221_l 



SEQ ID NO:28 
SEQ ID NO:29 



gm243_l 



SEQ ID NO:30 



10 



15 



20 



25 



In the sequences listed above which include an N at position 2, that position is occupied 
in preferred probes /primers by a biotinylated phosphoaramidite residue rather than a 
nucleotide (such as , for example, that produced by use of biotin phosphoramidite (1- 
dimethoxytrityloxy-2-(N-biotinyl-4-aminobutyl)-propyl-3-0-(2-cyanoethyl)-(N,N- 
diisopropyl)-phosphoramadite) (Glen Research, cat. no. 10-1953)). 

The design of the oligonucleotide probe should preferably follow these 
parameters: 

(a) It should be designed to an area of the sequence which has the fewest 
ambiguous bases ("N's"), if any; 

(b) It should be designed to have a T m of approx. 80 ° C (assuming 2° for each 
A or T and 4 degrees for each G or C). 

The oligonucleotide should preferably be labeled with g- 32 P ATP (specific activity 6000 
Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for 
labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 
label should preferably be removed by gel filtration chromatography or other established 
methods. The amount of radioactivity incorporated into the probe should be quantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
probe should be approximately 4e+6 dpm/pmole. 

The bacterial culture containing the pool of full-length clones should preferably 
be thawed and 100 pi of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampicillin at 100 pg/ml. The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in 
fresh L-broth. Aliquots of these dilutions should preferably be plated to determine the 
dilution and volume which will yield approximately 5000 distinct and well-separated 
colonies on solid bacteriological media containing L-broth containing ampicillin at 100 
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ug/ml and agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C. Other 
known methods of obtaining distinct, well-separated colonies can also be employed. 

Standard colony hybridization procedures should then be used to transfer the 
colonies to nitrocellulose filters and lyse, denature and bake them. 
5 The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 

6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na citrate /liter, adjusted to pH 7.0 with 
NaOH) containing 0.5% SDS, 100 ug/ml of yeast RNA, and 10 mM EDTA (approximately 
10 mL per 150 mm filter). Preferably, the probe is then added to the hybridization mix at 
a concentration greater than or equal to le+6 dpm/mL. The filter is then preferably 

1 0 incubated at 65°C with gentle agitation overnight. The filter is then preferably washed in 
500 mL of 2X SSC/0.5% SDS at room temperature without agitation, preferably followed 
by 500 mL of 2X SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. 
A third wash with 0.1X SSC/0.5% SDS at 65°C for 30 minutes to 1 hour is optional. The 
filter is then preferably dried and subjected to autoradiography for sufficient time to 

1 5 visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated 
using standard procedures. The clones can then be verified by restriction analysis, 
hybridization analysis, or DNA sequencing. 
2 0 Fragments of the proteins of the present invention which are capable of exhibiting 

biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cyclized using known methods, for example, 
as described in H.IL Saragovi, et al, Bio/Technology 10, 773-778 (1992) and in R.S. 
McDowell, et al, J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated 

2 5 herein by reference. Such fragments may be fused to carrier molecules such as 

immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. For example, fragments of the protein may be fused through "linker" sequences to 
the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 
could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 

3 0 be used to generate such fusions. For example, a protein - IgM fusion would generate a 

decavalent form of the protein of the invention. 

The present invention also provides both full-length and mature forms of the 
disclosed proteins. The full-length form of the such proteins is identified in the sequence 
listing by translation of the nucleotide sequence of each disclosed clone. The mature form 



BNSDOCiD <WO 98d0486A2 ' 



WO 98/40486 PCT/US98/04977 



of such protein may be obtained by expression of the disclosed full-length polynucleotide 
(preferably those deposited with ATCC) in a suitable mammalian cell or other host cell. 
The sequence of the mature form of the protein may also be determinable from the amino 
acid sequence of the full-length form. 
5 The present invention also provides genes corresponding to the cDNA sequences 

disclosed herein. "Corresponding genes" are the regions of the genome that are 
transcribed to produce the mRNAs from which the cDN A sequences are derived and any 
contiguous regions of the genome necessary for the regulated expression of such genes, 
including but not limited to coding sequences, 5' and 3' untranslated regions, alternatively 

10 spliced exons, introns, promoters, enhancers, and silencer or suppressor elements. The 
corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes 
or primers from the disclosed sequence information for identification and /or 
amplification of genes in appropriate genomic libraries or other sources of genomic 

15 materials. 

Where the protein of the present invention is membrane-bound (e.g., is a receptor), 
the present invention also provides for soluble forms of such protein. In such forms part 
or all of the intracellular and transmembrane domains of the protein are deleted such that 
the protein is fully secreted from the cell in which it is expressed. The intracellular and 
2 0 transmembrane domains of proteins of the invention can be identified in accordance with 
known techniques for determination of such domains from sequence information. 

Proteins and protein fragments of the present invention include proteins with 
amino acid sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 

2 5 identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 

identity) with that disclosed protein, where sequence identity is determined by comparing 
the amino acid sequences of the proteins when aligned so as to maximize overlap and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 

3 0 (more preferably 20 or more, most preferably 30 or more) contiguous amino acids that 

shares at least 75% sequence identity (more preferably, at least 85% identity; most 
preferably at least 95% identity) with any such segment of any of the disclosed proteins. 

Species homologs of the disclosed polynucleotides and proteins are also provided 
by the present invention. As used herein, a "species homologue" is a protein or 
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polynucleotide with a different species of origin from that of a given protein or 
polynucleotide, but with significant sequence similarity to the given protein or 
polynucleotide, as determined by those of skill in the art. Species homologs may be 
isolated and identified by making suitable probes or primers from the sequences provided 
5 herein and screening a suitable nucleic acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides 
or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide 
which also encode proteins which are identical, homologous, or related to that encoded 
by the polynucleotides . 

1 0 The invention also includes polynucleotides with sequences complementary to 

those of the polynucleotides disclosed herein. 

The present invention also includes polynucleotides capable of hybridizing 
under reduced stringency conditions, more preferably stringent conditions, and most 
preferably highly stringent conditions, to polynucleotides described herein. Examples of 

1 5 stringency conditions are shown in the table below: highly stringent conditions are those 
that are at least as stringent as, for example, conditions A-F; stringent conditions are at 
least as stringent as, for example, conditions G-L; and reduced stringency conditions are 
at least as stringent as, for example, conditions M-R. 
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Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 
(bp)* 


Hybridization Temperature and 
Buffer f 


Wash 

Temperature 
and Buffer' 


A 


DNA:DNA 


> 50 


65 °C; lxSSC-or- 

42°C; lxSSC, 50% formamide 


65°C; 0.3xSSC 


B 


DNA;DNA 


<50 


T B *;lxSSC 


Tg ~, 1 xSSC 


C 


DNA:RNA 


;> 50 


67°C; lxSSC-or- 

45°C; lxSSC, 50% formamide 


67° C; 0.3xSSC 


D 


DNA:RNA 


<50 


T D *; lxSSC 


T D *; lxSSC 


E 


RNA:RNA 


;> 50 


70°C; lxSSC-or- 

50°C; lxSSC, 50% formamide 


70° C; 0.3xSSC 


F 


RNA:RNA 


<50 


Tj:*; lxSSC 


T F *; lxSSC 


G 


DNA:DNA 


* 50 


65 °C; 4xSSC-or- 

42°C; 4xSSC, 50% formamide 


65°C; lxSSC 


H 


DNA:DNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNA:RNA 


a 50 


67°C; 4xSSC -or- 

45 °C; 4xSSC, 50% formamide 


67°C; lxSSC 


J 


DNAiRNA 


<50 


T,*; 4xSSC 


T,*; 4xSSC 


K 


RNA:RNA 


i 50 


70°C; 4xSSC -or- 

50°C; 4xSSC, 50% formamide 


67°C; lxSSC 


L 


RNA:RNA 


<50 


T L *; 2xSSC 


T L *; 2xSSC 


M 


DNA:DNA 


i 50 


50°C; 4xS5C -or- 

40°C; 6xSSC, 50% formamide 


50 °C; 2xSSC 


N 


DNA:DNA 


<50 


T N *; 6xSSC 


T N *; 6xSSC 


O 


DNA:RNA 


a 50 


55°C; 4xSSC-or- 

42°C; 6xSSC, 50% formamide 


55 °C; 2xSSC 


P 


DNAiRNA 


<50 


T r *; 6xSSC 


T P *; 6xSSC 


Q 


RNA:RNA 


z. 50 


60°C; 4xSSC-or- 

45°C; 6xSSC, 50% formamide 


60 °C; 2xSSC 


R 


RNA:RNA 


<50 


T R *; 4xSSC 


T R *; 4xSSC 



*: The hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides. When 
hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed 
to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the 

2 5 hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region 

or regions of optimal sequence complementarity. 

f : SSPE (lxSSPE is 0.15M NaCl, lOmM NaH 2 P0 4/ and 1.25mM EDTA, pH 7.4) can be substituted for SSC 
(lxSSC is 0.15M NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 

3 0 *T B - T R : The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should 

be 5-1 0°C less than the melting temperature (T m ) of the hybrid, where T m is determined according to the 
following equations. For hybrids less than 18 base pairs in length, T m (°C) = 2(# of A + T bases) + 4(# of G + 
C bases). For hybrids between 18 and 49 base pairs in length, T m (°C) = 81.5 + 16.6(log 10 [Na + ]) + 0.41(%G+C) - 
(600/N), where N is the number of bases in the hybrid, and [Na + ] is the concentration of sodium ions in the 
3 5 hybridization buffer ([Na + ] for lxSSC = 0.165 M). 
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Additional examples of stringency conditions for polynucleotide hybridization are 
provided in Sambrook, J., E.R Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, F.M. Ausubel et al., eds., 
5 John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
25%(more preferably at least 50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
1 0 95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence identity is determined by comparing the sequences of the hybridizing 
polynucleotides when aligned so as to maximize overlap and identity while minimizing 
seauence eaos. 

The isolated polynucleotide of the invention may be operably linked to an 
15 expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein 
recombinantly. Many suitable expression control sequences are known in the art. General 
methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably 
2 0 linked" means that the isolated polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way that the protein is expressed 
by a host cell which has been transformed (transfected) with the ligated 
polynucleotide/ expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 

2 5 protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 

Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 
cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

3 0 Alternatively, it may be possible to produce the protein in lower eukaryotes such 

as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli, Bacillus subtilis r Salmonella typhimurium. or any bacterial 

33 



BNSDOCiD <WO 9840486A2 I > 



WO 98/40486 PCT7US98/04977 




strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
5 enzymatic methods. 

The protein may also be produced by operably linking the isolated polynucleotide 
of the invention to suitable control sequences in one or more insect expression vectors, 
and employing an insect expression system. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
10 e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and such methods are 
well known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

15 The protein of the invention may be prepared by culturing transformed host cells 

under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 

2 0 containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA 
Sepharose®; one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chroma tograp hy . 

2 5 Alternatively, the protein of the invention may also be expressed in a form which 

will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin 
(TRX). Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, MA), Pharmacia (Piscataway, NJ) and 

3 0 InVitrogen, respectively. The protein can also be tagged with an epitope and 

subsequently purified by using a specific antibody directed to such epitope. One such 
epitope ("Flag") is commercially available from Kodak (New Haven, CT). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
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methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
5 with the present invention as an "isolated protein." 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

10 The protein may also be produced by known conventional chemical synthesis. 

Methods for constructing the proteins of the present invention by synthetic means are 
known to those skilled in the art. The synthetically-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and/or conformational 
characteristics with proteins may possess biological properties in common therewith, 

15 including protein activity. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the development of antibodies. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 

2 0 provided or deliberately engineered. For example, modifications in the peptide or DNA 
sequences can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 

2 5 amino acid to alter the conformation of the molecule. Techniques for such alteration, 

substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., U.S. Patent No. 4,518,584). Preferably, such alteration, substitution, replacement, 
insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 

3 0 expected to retain protein activity in whole or in part and may thus be useful for screening 

or other immunological methodologies may also be easily made by those skilled in the art 
given the disclosures herein. Such modifications are believed to be encompassed by the 
present invention. 
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USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the uses or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
5 invention may be provided by administration or use of such proteins or by administration 
or use of polynucleotides encoding such proteins (such as, for example, in gene therapies 
or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

1 0 The polynucleotides provided by the present invention can be used by the research 

community for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in disease 

15 states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 
with endogenous DNA sequences in patients to identify potential genetic disorders; as 
probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" 

2 0 known sequences in the process of discovering other novel polynucleotides; for selecting 
and making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or potentially 

2 5 binds to another protein (such as, for example, in a receptor-ligand interaction), the 

polynucleotide can also be used in interaction trap assays (such as, for example, that 
described in Gyuris et al., Cell 75:791-803 (1993)) to identify polynucleotides encoding the 
other protein with which binding occurs or to identify inhibitors of the binding 
interaction. 

3 0 The proteins provided by the present invention can similarly be used in assay to 

determine biological activity, including in a panel of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit another immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
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the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
5 protein can be used to identify the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

Any or all of these research utilities are capable of being developed into reagent 
1 0 grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Cloning: A Laboratory Manual", 2d ed.. Cold Spring Harbor Laboratory Press, Sambrook, 
J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to 
1 5 Molecular Cloning Techniques", Academic Press, Berger, S.L. and A.R. Kimmel eds., 1987. 

Nutritional Uses 

Polynucleotides and proteins of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein 
2 0 or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 
capsules. In the case of microorganisms, the protein or polynucleotide of the invention 

2 5 can be added to the medium in or on which the microorganism is cultured. 

Cytokine and Cell Proliferation/Differentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 
inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 

3 0 induce production of other cytokines in certain cell populations. Many protein factors 

discovered to date, including all known cytokines, have exhibited activity in one or more 
factor dependent cell proliferation assays, and hence the assays serve as a convenient 
confirmation of cytokine activity. The activity of a protein of the present invention is 
evidenced by any one of a number of routine factor dependent cell proliferation assays 
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for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, 
MC9/G, M+ (preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e and CMK. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Bertagnolli et al., J. Immunol. 145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 
133:327-341, 1991; Bertagnolli, et al., J. Immunol. 149:3778-3783, 1992; Bowman et al., J. 
Immunol. 152: 1756-1761, 1994. 

Assays for cytokine production and /or proliferation of spleen cells, lymph node 
cells or thymocytes include, without limitation, those described in: Polyclonal T cell 
stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current Protocols in Immunology, J.E.e.a. 
Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human Interferon y, Schreiber, R.D. In Current Protocols in 
Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L.S. and Lipsky, P.E. In Current 
Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 
Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 
336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 
Measurement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 
Interleukin 11 - Bennett, F., Giannotti, J., Clark, S.C. and Turner, K. J. In Current Protocols 
in Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Clark, S.C. 
and Turner, K.J. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991. 
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Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. Margulies, 
5 E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., 
Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et ah, Eur. J. Immun. 
11:405-411, 1981; Takai et aL, J. Immunol. 137:3494-3500, 1986; Takai et aL, J. Immunol. 
10 140:508-512, 1988. 



Immune Stimulating or Suppressing Activity 

A protein of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 
15 are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SOD)), e.g., 
in regulating (up or down) growth and proliferation of T and /or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HTV) as well as bacterial or fungal 

2 0 infections, or may result from autoimmune disorders. More specifically, infectious 

diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections 
such as candidiasis. Of course, in this regard, a protein of the present invention may also 
25 be useful where a boost to the immune system generally may be desirable, i.e., in the 
treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 

3 0 Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 

myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. 
Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
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example, organ transplantation), may also be treatable using a protein of the present 
invention. 

Using the proteins of the invention it may also be possible to immune responses, 
in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing the induction of an 
immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression 
of T cell responses is generally an active, non-antigen-specific, process which requires 
continuous exposure of the T cells to the suppressive agent. Tolerance, which involves 
inducing non-responsiveness or anergy in T cells, is distinguishable from 
immunosuppression in that it is generally antigen-specific and persists after exposure to 
the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack 
of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft- versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 
transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 
the transplant. The administration of a molecule which inhibits or blocks interaction of 
a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomelic form of a peptide having B7-2 activity alone or in conjunction with a 
monomeric form of a peptide having an activity of another B lymphocyte antigen {e.g., B7- 
1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the 
molecule to the natural ligand(s) on the immune cells without transmitting the 
corresponding costimulatory signal. Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, the lack of costimulation may also be sufficient to 
anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
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tolerance in a subject, it may also be necessary to block the function of a combination of 
B lymphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
5 humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al, Science 257:789-792 (1992) and Turka et al, Proc. Natl. Acad. 
Sci USA, 89:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
10 Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte antigen function in vivo on the development 
of that disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 

1 5 activation of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 
Administration of reagents which block costimulation of T cells by disrupting 
receptonligand interactions of B lymphocyte antigens can be used to inhibit T cell 

2 0 activation and prevent production of autoantibodies or T cell-derived cytokines which 
may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from 
the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune 
disorders can be determined using a number of well-characterized animal models of 

2 5 human autoimmune diseases. Examples include murine experimental autoimmune 

encephalitis, systemic lupus erythmatosis in MKL/lpr/lpr mice or NZB hybrid mice, 
murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1989, pp. 840-856). 

3 0 Upregulation of an antigen function (preferably a B lymphocyte antigen function), 

as a means of up regulating immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of enhancing an existing immune 
response or eliciting an initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 
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viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
lymphocyte antigens systemically. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
5 by removing T cells from the patient, costimulating the T cells in vitro with viral antigen- 
pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses would be to isolate infected cells from a patient, transfect them with a nucleic 

1 0 acid encoding a protein of the present invention as described herein such that the cells 
express all or a portion of the protein on their surface, and reintroduce the transfected 
cells into the patient. The infected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of antigen function 

15 (preferably B lymphocyte antigen function) may be useful in the induction of tumor 
immunity. Tumor cells (e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a nucleic acid encoding at least one peptide of the present 
invention can be administered to a subject to overcome tumor-specific tolerance in the 
subject. If desired, the tumor cell can be transfected to express a combination of peptides. 

2 0 For example, tumor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 
transfected tumor cells are returned to the patient to result in expression of the peptides 
on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 

2 5 to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen(s) on the surface of the tumor cell provides the necessary 
costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class II 

3 0 molecules, or which fail to reexpress sufficient amounts of MHC class I or MHC class II 

molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I a chain protein and P 2 
microglobulin protein or an MHC class II a chain protein and an MHC class II P chain 
protein to thereby express MHC class I or MHC class II proteins on the cell surface. 
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Expression of the appropriate class I or class II MHC in conjunction with a peptide having 
the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated 
immune response against the transfected tumor cell. Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class II associated protein, such 
5 as the invariant chain, can also be cotransfected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of tumor associated 
antigens and induce tumor specific immunity. Thus, the induction of a T cell mediated 
immune response in a human subject may be sufficient to overcome tumor-specific 
tolerance in the subject. 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 

limitation, those described in: Current Protocols in Tmirmnology, Ed by J. E. Coligan, A.M. 

Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates 
15 and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1- 

3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. 

USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., 

J. Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., 

J. Immunol. 140:508-512, 1988; Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 
2 0 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al, J. Immunol. 

135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. 

Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., 

Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching 

2 5 (which will identify, among others, proteins that modulate T-cell dependent antibody 

responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 
antibody production, Mond, J J. and Brunswick, M. In Current Protocols in Immunology. 
J.E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

3 0 Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 

proteins that generate predominantly Thl and CTL responses) include, without limitation, 
those described in: Current Protocols in Immunology, Ed by }. E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
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7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai 
et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
5 described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 
154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 1995; 
Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 264:961-965, 
1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj 

10 et al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 

15 et aL, Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, 
Journal of Immunology 145:4037-4045, 1990; Zamai et aL, Cytometry 14:891-897, 1993; 
Gorczyca et aL, International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 

2 0 development include, without limitation, those described in: Antica et aL, Blood 
84:111-117, 1994; Fine et aL, Cellular Immunology 155:111-122, 1994; Galy et aL, Blood 
85:2770-2778, 1995; Toki et aL, Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 



Hematopoiesis Regulatine Activity 

25 A protein of the present invention may be useful in regulation of hematopoiesis 

and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell 
lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and 
proliferation of erythroid progenitor cells alone or in combination with other cytokines, 

3 0 thereby indicating utility, for example, in treating various anemias or for use in 
conjunction with irradiation/chemotherapy to stimulate the production of erythroid 
precursors and/ or erythroid cells; in supporting the growth and proliferation of myeloid 
cells such as granulocytes and monocytes /macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat consequent 
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myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 
disorders such as thrombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and/or in supporting the growth and proliferation of 
5 hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or 

10 ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

15 Suitable assays for proliferation and differentiation of various hematopoietic lines 

are cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 
2 0 al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 
others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 

2 5 Hematopoietic Cells, R.I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 

NY. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-5911, 1992; Primitive 
hematopoietic colony forming cells with high proliferative potential, McNiece, I.K. and 
Briddell, R.A. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 23-39, 
Wiley-Liss, Inc., New York, NY. 1994; Neben et al., Experimental Hematology 22:353-359, 

3 0 1994; Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematopoietic 

Cells. R.I. Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long 
term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 163-179, 
Wiley-Liss, Inc., New York, NY. 1994; Long term culture initiating cpll assay, Sutherland, 
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HJ. In Culture of Hematopoietic Cells. R.L Freshney, et al eds. Vol pp. 139-162, Wiley-Liss, 
Inc., New York, NY. 1994. 

Tissue Growth Activity 

A protein of the present invention also may have utility in compositions used for 
bone, cartilage, tendon, ligament and /or nerve tissue growth or regeneration, as well as 
for wound healing and tissue repair and replacement, and in the treatment of burns, 
incisions and ulcers. 

A protein of the present invention, which induces cartilage and /or bone growth 
in circumstances where bone is not normally formed, has application in the healing of 
bone fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in closed as 
well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment 
to attract bone-forming cells, stimulate growth of bone-forming cells or induce 
differentiation of progenitors of bone-forming cells. A protein of the invention may also 
be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation 
of bone and /or cartilage repair or by blocking inflammation or processes of tissue 
destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory 



Another category of tissue regeneration activity that may be attributable to the 
protein of the present invention is tendon /ligament formation. A protein of the present 
invention, which induces tendon/ligament-like tissue or other tissue formation in 
circumstances where such tissue is not normally formed, has application in the healing of 
tendon or ligament tears, deformities and other tendon or ligament defects in humans and 
other animals. Such a preparation employing a tendon/ligament-like tissue inducing 
protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon /ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
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congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 
tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and /or sequestering agent as a carrier as is well known in 
the art. 

The protein of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 



traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present 
invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 
invention. 

Proteins of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) 
and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 
comprising such tissues. Part of the desired effects may be by inhibition or modulation 
of fibrotic scarring to allow normal tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 
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A protein of the present invention may alsooe useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
5 differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 



10 in: International Patent Publication No. W095/ 16035 (bone, cartilage, tendon); 

International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 

Publication No. WO91/07491 (skin, endothelium ). 

Assays for wound healing activity include, without limitation, those described in: 

Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, HI and Rovee, DT, eds.), Year 
15 Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 

Dermatol 71:382-84 (1978). 



2 0 activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 
invention, alone or in heterodimers with a member of the inhibin a family, may be useful 
as a contraceptive based on the ability of inhibins to decrease fertility in female mammals 

2 5 and decrease spermatogenesis in male mammals. Administration of sufficient amounts 

of other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin- 
P group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 

3 0 United States Patent 4,798,885. A protein of the invention may also be useful for 

advancement of the onset of fertility in sexually immature mammals, so as to increase the 
lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 



Assays for tissue generation activity include, without limitation, those described 



Activin /Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin-related 
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Assays for activin/inhibin activity include, without limitation, those described in: 
Vale et al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et 
al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. 
Natl. Acad. Sci. USA 83:3091-3095, 1986. 

Chemo tactic /Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic activity 
(e.g., act as a chemokine) for mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/ or endothelial cells. 
Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell 
population to a desired site of action. Chemotactic or chemokinetic proteins provide 
particular advantages in treatment of wounds and other trauma to tissues, as well as in 
treatment of localized infections. For example, attraction of lymphocytes, monocytes or 
neutrophils to tumors or sites of infection may result in improved immune responses 
against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 
of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration 
of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell population. Suitable assays for movement and adhesion 
include, without limitation, those described in: Current Protocols in Immunology, Ed by 
J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 
Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and 
beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153: 1762-1768, 1994. 
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Hemostatic and Thrombolytic Activity 

A protein of the invention may also exhibit hemostatic or thrombolytic activity. 
As a result, such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
5 and other hemostatic events in treating wounds resulting from trauma, surgery or other 
causes. A protein of the invention may also be useful for dissolving or inhibiting 
formation of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as, for example, infarction of cardiac and central nervous system vessels 
(e.g., stroke). 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis 
Res. 45:413-419, 1987; Humphrey et aL, Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

15 35:467-474, 1988. 



Receptor /Ligand Activity 

A protein of the present invention may also demonstrate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor/ligand interactions. Examples of 
2 0 such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 

2 5 development of cellular and humoral immune responses). Receptors and ligands are also 

useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

3 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described in:Current Protocols in Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W.Strober, Pub. Greene Publishing Associates and 
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Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 
1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et aL, Cell 80:661-670, 
5 1995. 



Anti-Inflammatory Activity 

Proteins of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 

1 0 the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities can be used to treat 

15 inflammatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 

2 0 from over production of cytokines such as TNF or IL-1 . Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 



Cadherin /Tumor Invasion Suppressor Activity 

Cadherins are calcium-dependent adhesion molecules that appear to play major 
25 roles during development, particularly in defining specific cell types. Loss or alteration 
of normal cadherin expression can lead to changes in cell adhesion properties linked to 
tumor growth and metastasis. Cadherin malfunction is also implicated in other human 
diseases, such as pemphigus vulgaris and pemphigus foliaceus (auto-immune blistering 
skin diseases), Crohn's disease, and some developmental abnormalities. 
3 0 The cadherin superfamily includes well over forty members, each with a distinct 

pattern of expression. All members of the superfamily have in common conserved 
extracellular repeats (cadherin domains), but structural differences are found in other 
parts of the molecule. The cadherin domains bind calcium to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino acids in the 
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first cadherin domain provide the basis for homophilic adhesion; modification of this 
recognition site can change the specificity of a cadherin so that instead of recognizing only 
itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophilic adhesion with other cadherins. 
5 E-cadherin, one member of the cadherin superfamily, is expressed in epithelial cell 

types. Pathologically, if E-cadherin expression is lost in a tumor, the malignant cells 
become invasive and the cancer metastasizes. Transfection of cancer cell lines with 
polynucleotides expressing E-cadherin has reversed cancer-associated changes by 
returning altered cell shapes to normal, restoring cells' adhesiveness to each other and to 

10 their substrate, decreasing the cell growth rate, and drastically reducing anchorage- 
independent cell growth. Thus, reintroducing E-cadherin expression reverts carcinomas 
to a less advanced stage. It is likely that other cadherins have the same invasion 
suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 

15 encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 
in these cells by providing normal cadherin expression. 

Cancer cells have also been shown to express cadherins of a different tissue type 
than their origin, thus allowing these cells to invade and metastasize in a different tissue 

2 0 in the body. Proteins of the present invention with cadherin activity, and polynucleotides 
of the present invention encoding such proteins, can be substituted in these cells for the 
inappropriately expressed cadherins, restoring normal cell adhesive properties and 
reducing or eliminating the tendency of the cells to metastasize. 

Additionally, proteins of the present invention with cadherin activity, and 

2 5 polynucleotides of the present invention encoding such proteins, can used to generate 

antibodies recognizing and binding to cadherins. Such antibodies can be used to block 
the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 
forming a tumor elsewhere. Such an anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 

3 0 cancer, the less cadherin expression there will be, and this decrease in cadherin expression 

can be detected by the use of a cadherin-binding antibody. 

Fragments of proteins of the present invention with cadherin activity, preferably 
a polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 
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to block cadherin function by binding to cadherins and preventing them from binding in 
ways that produce undesirable effects. Additionally, fragments of proteins of the present 
invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
5 encoding such protein fragments, can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 
Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 



10 Tumor Inhibition Activity 

In addition to the activities described above for immunological treatment or 
prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
A protein may inhibit tumor growth directly or indirectly (such as, for example, via 
ADCC). A protein may exhibit its tumor inhibitory activity by acting on tumor tissue or 

15 tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as, for example, by inhibiting angiogenesis), by causing production of other 
factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating 
or inhibiting factors, agents or cell types which promote tumor growth. 



2 0 Other Activities 

A protein of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious 
agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 
(suppressing or enhancing) bodily characteristics, including, without limitation, height, 

2 5 weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 

or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 

3 0 carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 

effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
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lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen 
5 in a vaccine composition to raise an immune response against such protein or another 
material or entity which is cross-reactive with such protein. 



without limitation from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutical^ acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fillers, 
salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 

15 "pharmaceutical^ acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, 

2 0 IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem 
cell factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or compliment its activity or use 
in treatment. Such additional factors and /or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of the invention, 

2 5 or to minimize side effects. Conversely, protein of the present invention may be included 

in formulations of the particular cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects 
of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent. 

3 0 A protein of the present invention may be active in multimers (e.g., heterodimers 

or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 



ADMINISTRATION AND DOSING 



10 



A protein of the present invention (from whatever source derived, including 
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The pharmaceutical composition of the invention may be in the form of a complex 
of the protein(s) of present invention along with protein or peptide antigens. The protein 
and /or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B 
lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
5 lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins 
including those encoded by class I and class II MHC genes on host cells will serve to 
present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that 
1 0 can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the 
invention. 

The pharmaceutical composition of the invention may be in the form of a liposome 
15 in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, 
2 0 and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in U.S. Patent No. 4,235,871; U.S. Patent No. 4,501,728; U.S. 
Patent No. 4,837,028; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference. 

As used herein, the term "therapeutically effective amount" means the total 

2 5 amount of each active component of the pharmaceutical composition or method that is 

sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, the term refers to that ingredient alone. When applied to 

3 0 a combination, the term refers to combined amounts of the active ingredients that result 

in the therapeutic effect, whether administered in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
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administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-administered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
5 administered either simultaneously with the cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If 
administered sequentially, the attending physician will decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic 
1 0 factors. 

Administration of protein of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 

1 5 Intravenous administration to the patient is preferred. 

When a therapeutically effective amount of protein of the present invention is 
administered orally, protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 

2 0 an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 
When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal 
or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
may be added. The liquid form of the pharmaceutical composition may further contain 

2 5 physiological saline solution, dextrose or other saccharide solution, or glycols such as 

ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 
form, the pharmaceutical composition contains from about 0.5 to 90% by weight of protein 
of the present invention, and preferably from about 1 to 50% protein of the present 
invention. 

3 0 When a therapeutically effective amount of protein of the present invention is 

administered by intravenous, cutaneous or subcutaneous injection, protein of the present 
invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. 
The preparation of such parenterally acceptable protein solutions, having due regard to 
pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
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pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should 
contain, in addition to protein of the present invention, an isotonic vehicle such as Sodium 
Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringers Injection, or other vehicle as known in the art. The 
5 pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. 

The amount of protein of the present invention in the pharmaceutical composition 
of the present invention will depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the patient has undergone. 

10 Ultimately, the attending physician will decide the amount of protein of the present 
invention with which to treat each individual patient. Initially, the attending physician 
will administer low doses of protein of the present invention and observe the patient's 
response. Larger doses of protein of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 

1 5 increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 ug to about 100 
mg (preferably about O.lng to about 10 mg, more preferably about 0.1 ug to about 1 mg) 
of protein of the present invention per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 

2 0 present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 
contemplated that the duration of each application of the protein of the present invention 
will be in the range of 12 to 24 hours of continuous intravenous administration. 
Ultimately the attending physician will decide on the appropriate duration of intravenous 

2 5 therapy using the pharmaceutical composition of the present invention. 

Protein of the invention may also be used to immunize animals to obtain 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 
antibodies may be obtained using either the entire protein or fragments thereof as an 
immunogen. The peptide immunogens additionally may contain a cysteine residue at the 

3 0 carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin 

(KLH). Methods for synthesizing such peptides are known in the art, for example, as in 
R.P. Merrifield, J. Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEBS Lett. 
211, 10 (1987). Monoclonal antibodies binding to the protein of the invention may be 
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antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting 
5 and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locally as an implant or device. When 

1 0 administered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, the composition may desirably 
be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or 
tissue damage. Topical administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 

15 optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and /or cartilage formation, the 
composition would include a matrix capable of delivering the protein-containing 
composition to the site of bone and/or cartilage damage, providing a structure for the 

2 0 developing bone and cartilage and optimally capable of being resorbed into the body. 
Such matrices may be formed of materials presently in use for other implanted medical 
applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 

2 5 application of the compositions will define the appropriate formulation. Potential 

matrices for the compositions may be biodegradable and chemically defined calcium 
sulfate, tricalciumphosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins 

3 0 or extracellular matrix components. Other potential matrices are nonbiodegradable and 

chemically defined, such as sintered hydroxapatite, bioglass, aluminates, or other 
ceramics. Matrices may be comprised of combinations of any of the above mentioned 
types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in composition, such as in calcium- 
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aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 

Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and glycolic 
acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
5 In some applications, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions 
from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
10 ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of 
carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic 

A ns^Aitim I /-»■*-»-» -» f i-k r^rvlir/ /-k+V-k x rl <~» rrltrr*^^ r»Alur\VlfofV>irlor\D AVI r\ O /^^ rV^O Yir^riT^A/l 

polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 0.5-20 
15 wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 
provide appropriate handling of the composition, yet not so much that the progenitor cells 
are prevented from infiltrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progenitor cells. 
2 0 In further compositions, proteins of the invention may be combined with other 

agents beneficial to the treatment of the bone and /or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor 
(EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-oc and 
TGF-P), and insulin-like growth factor (IGF). 

2 5 The therapeutic compositions are also presently valuable for veterinary 

applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins of the present invention. 

The dosage regimen of a protein-containing pharmaceutical composition to be 
used in tissue regeneration will be determined by the attending physician considering 

3 0 various factors which modify the action of the proteins, e.g., amount of tissue weight 

desired to be formed, the site of damage, the condition of the damaged tissue, the size of 
a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity 
of any infection, time of administration and other clinical factors. The dosage may vary 
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the pharmaceutical composition. For example, the addition of other known growth 
factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect 
the dosage. Progress can be monitored by periodic assessment of tissue/bone growth 
and /or repair, for example, X-rays, histomorphometric determinations and tetracycline 
5 labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, without 
10 limitation, in the form of viral vectors or naked DNA). 

Cells may also be cultured ex vivo in the presence of proteins of the present 
invention in order to proliferate or to produce a desired effect on or activity in such cells. 
Treated cells can then be introduced in vivo for therapeutic purposes. 

15 Patent and literature references cited herein are incorporated by reference as if 

fully set forth. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT : Jacobs , Kenneth 



McCoy, John M. 
LaVallie, Edward R. 
Racie, Lisa A. 
Merberg, David 
Treacy, Maurice 
Spaulding , Vikki 
Agostino, Michael J. 



(ii) TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 



(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Genetics Institute , Inc . 

(B) STREET: 87 Cambr idgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

( C ) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Sprunger , Suzanne A. 

(B) REGISTRATION NUMBER: 41,32 3 

(ix) TELECOMMUNICATION INFORMATION: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



ENCODING THEM 



(A) TELEPHONE: (617) 498-8284 

(B) TELEFAX: (617) 876-5851 
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MOLECULE TYPE: 



cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GTTACCTCTC TGTAGCTATG TTTTCCCATG GTTGTTTTAA GAGGAATAGA AGAAAGGAAA 6 0 

ACAGCTTGTG AAAATCCTAA GTTGTATTTG AACGAGCAAG CTGTGTTTCC T CAT T AAAAC 12 0 

ATTTATTTCG CATTTGATGG TCCATAACTG CCCATTTACC TCAGGATGCC TCCATATGAT 18 0 

GAAAATAAGA ACAGAGTTGA AAGAAGTCTC CATAAACACA ACGCACATTG GCAAATGTCA 24 0 

TATTCTTGTT CCTTAAGGGA T T AG AG AAC A CTTTCTTCTC TTTGTCTTTG CCCCCAAAGT 30 0 

AAAAGCTATA AGCTTTTATA ATTAAATAAT AAGACTGAAT AACCATAAGC GCAAATAATA 3 60 

TGTAGTATTA TGAGAAATAC TGGGAAAAAG GACACTTACT GTGTGACTTA AATTGATTAA 42 0 

AGGGTTATTC AGTTCAACTC TCTTGAATCT AATTAGTATT TTTGTGTCAT TTATTATTAT 4 80 

AGGGCACACA TTTTTTACAT TTGATTTAAC TTGACCAAAA TTAAATGAGC AAATGTTTAT 540 

TGCTATGTCC ATTGTTTTCC TTTCTCTGTC ACTGTTAAAA AGAGGAGCCA TGGCTTCTGC 60 0 

TTCTTCTGTG TATTCTCCAT TAGACCTTCT TCATCCACCC TCTTCCCCAT CCCTTTCAGC 66 0 

TCTGAAGGGT CTATAAATGA AAGTGGGTAC CAACTGATTC AATAGGACTT ATATCTTACC 72 0 

AAATAACGTT TTATTGTCTT TGTTCTATGT ATTTGCAGAG AAATTGTAAG TATCTTTAAA 7 80 

ACCAATTAAC AAAGCCCTGT GGGTCTTTCA ATCAAGACCT TTGTAAACAT CTCTACTAGC 84 0 

CCATACTCCC CCAAACTTCT TGCACATGGT AGAAGATGAC ATTAAATAAA GCACATTATA 90 0 

AGGTGCAATG AGCTTTATTC TAAAAATATT GTCTGGATGT GAAAGTAAGT TCTTGTTCAT 9 60 

AAAATGTTAT TAGTAAAATG TTATTAGATT AAAATTATGG AGTAAGCATT TGGCAAACTG 102 0 

ATTGACTCTT CACTGGAAAG ACCAGGCTTT TTAGGACACA TTTCTGTTCA TGCTTAAGGT 108 0 

CAGAAGTCAA TCAAAGGCAA CCAAAAAAAA AAAAAAA 1117 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Phe lie Ala Met Ser lie Val Phe Leu Ser Leu Ser Leu Leu Lys 
15 10 15 

Arg Gly Ala Met Ala Ser Ala Ser Ser Val Tyr Ser Pro Leu Asp Leu 
20 25 30 

Leu His Pro Pro Ser Ser Pro Ser Leu Ser Ala Leu Lys Gly Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GTGGAGAGGA GGTGGCAGCG GCCCGGGAGG CCGGAGCCAA GCCAGCGACC CACCATGGAG 6 0 

ACCCGCTACA ACCTGAAGAG TCCGGCTGTT AAACGTTTAA TGAAAGAAGC GGCAGAATTG 120 

AAAGATCCAA CAGATCATTA CCATGCGCAG CCTTTAGAGG ATAACCTTTT TGAATGGCAC 180 

TTCACGGTTA GAGGGCCCCC AGACTCCGAT TTTGATGGAG GAGTTTATCA CGGGCGGATA 240 

GTACTGCCAC CAGAGTATCC CATGAAACCA CCAAGCATTA TTCTCCTAAC GGCTAATGGT 300 

CGATTTGAAG TGGGCAAGAA AATCTGTTTG AGCATCTCAG GCCATCATCC TGAAACTTGG 3 60 

CAGCCTTCGT GGAGTATAAG GACAGCATTA TTAGCCATCA TTGGGTTTAT GCCAACAAAA 42 0 

GGAGAGGGAG CCATAGGTTC TC T AG AT AAC ACTCCTGAGG AAAGAAGAGC ACTTGCCAAA 48 0 

AAATCACAAG ATTTCTGTTG TGAAGGATGT GGCTCTGCCA TGAAGGATGT CCTGTTGCCT 54 0 

TTAAAATCTG GAAGCGATTC AAGCCAAGCT GACCAAGAAG CCAAAGAACT GGCTAGGCAA 600 

ATAAGCTTTA AGGCAGAAGT CAATTCATCT GGAAACACTA TCTCTGAGTC AG AC TTAAAC 660 

CACTCTTTTT CACTAACTGA TTTACAAGAT GAT AT AC C T A CAACATTCCA GGGTGCTACG 72 0 

GCCAGTACAT CGTACGGACT CCAGAATTCC TCAGCAGCAT CCTTTCATCA ACCTACCCAA 78 0 

CCTGTAGCTA AGAATACCTC CATGAGCCCT CGACAGCGCC GGGCCCAGCA GCAGAGTCAG 84 0 
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AGAAGGTTGT CTACTTCACC 
GATCATGGTG GGTCAGCTGT 
TTCCGACGAA TATATCTGGC 
GTGACTTATG AGCTGTGACT 
GAATTGTTTA CAAAAAGATT 
AGTTTAGTAT AAATATATCT 
GGGACTAGAA GTCAAAGTTC 
GTTTTTTCTC ATGTTTCATC 
TTTGAAATTT TTGGATAAAA 
ATACCACTTA TTGAGTACTC 
AAAAGGGAAA GCAAAACTCC 
TGGAATTGCT CATTCTGTGA 
TATTTCAGTC AAACATTTAT 
AAGAGGGAGT ACTTTGTAAC 
GCCATAATCA CACAAATTTA 
AAGGGCCCAA GCCAGGAGAT 
GATTTGAATG TTTGGTCTGC 
TTTATAATCA GTATAAATTT 
GTCCTATTTT AATAGAAAAC 
TTACTTAGAA TTGTCCACCT 
ATATAACTTA GTGCTAGCTA 
CTGGTGAAAC C AT AT AATC A 
GAGGGCACTC TTGAGGGTGT 
GCTTTCAAAT AGGTTATGCT 
ACAATGTTGG AAGTGGTATT 
ATCTTGAATA GTATAGGGGA 
ATTATGGACA AGAGAAATAA 
GTTTATGGGA CATTCGTTCT 
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AGATGTAATC CAGGGCCACC AGCCAAGAGA CAACCACACT 900 

ACTGATTGTC ATCCTGACTT TGGCATTGGC AGCTCTTATA 96 0 

AAACGAATAC ATATTTGACT TTGAGTTATA ATATGGTTTT 102 0 

CAACTGCTTC ATTAAACATT CTGCATTGGG TATAATCTAA 10 80 

ATTTTGTATT TACCCTTCAT TCCTTTTTTT GATCCTTGTA 1140 

AGACATTCAG ACTGTGTCTA GCAGTTACGT CCTGCTTAAA 12 00 

CTTGTCTCAC TATTTGATCT GCTTTGCAGG GAAATAACTT 12 6 0 

TTCTTTTTAT GTAAATTTGT AATACTTTCC TATATTGCCC 13 2 0 

GATGATGTTT TAAGTTCCAA TGAGTATTAC TAGTTACTCA 13 8 0 

TGTTTCTACG TATGTAGAAT GTATAGGGAT AGAAGAGTTG 1440 

TCAAGTAGCT TCCTTAAAAT GTC ATT CAT A GGAGATGTAC 15 00 

CTTTATTTGT GTCCTAAACA TTCTTCAGTG AAAATAATTT 15 60 

GAGGAAATGA GATCACATCT TTGTCACTGG ATGCTACTTG 162 0 

CACTTTGATA TGCTGTTATC ACCACCCCCT GCCCTCTGCT 16 80 

AAAAGAAAGA AAACAGTCTT CCATAGATTT TTAAGGAAGA 17 40 

CGCTTGGTTT TCTTCCAGAA GTTAAATGGG GGGATCTGAA 18 00 

TTTGAAATGT ATGTCTTTTG GGATGTATTA TATGCCTAGC 1860 

TAATTATTCC AGGAATATGC ATAATATTGA AATATTTCAT 19 2 0 

CTCAGGGCCC AAGTAACAGT GATAGAAGTT AG AAAAAC C T 19 8 0 

AGTCAGAGCC CAAGAAAGAA TTTTCAGTGG AAAAATCAAT 204 0 

GCGCCACAGA CTCTAGTAGA TAATATTATC ATCATAATGG 2100 

CAGAAAAACA TTGCCTTCAG CATGTTCAGT TCGCAGCACT 2160 

TGTTAATGAA GATTTAATTT TTAAATACAG GTGGTTCCAA 222 0 

CCAAAAGTGT TATTTGTAAG TTAATTTTTT TACAAGTCAA 22 80 

TAGGTTCTAG ATCGGTCCAC GAAAGTTAGC CCATATGTAT 2340 

GGGTATTCAT AAAGTCCTTA TGTGGTTTTA ACTAAGTGAA 2 40 0 

TTGTAAAATC GTCTTAAAGG AAAATTTAAT TTTTACTCCT 2460 

ATTAACTGTC AGACACAATT TCTGTTTTCA TCTGAGAGCC 2 52 0 
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AGTTTTCCTT TATTTCTACA TCTAAAATAA GAAC ATATTG TACATTATTA TATAATACAG 2 58 0 

AATTGTCTTA AACTTTAATA AATTCGCATT TTAAAGGTGT TTACAGATTA TTTTTTATAT 2 64 0 

CTGTAGCTGA ATTTGTTAAA GTCTAAAAAG CTCAAGGACT TTATGAAGAT CTCATTATAT 27 00 

GAGGAAAATC ATAGGTTACC ATTTTATAAC TCTATTGCCA TAAGAAAATA CACTCTAAAA 27 6 0 

TCTTGATTTG AAACATATTA GAAACCTTG A TTCAGTGCTC AGTGGTCTCC TAGTAAGAAG 282 0 

TCACCGACGG TAGCGTCATA TGAGAAGAAA GAAATCCCCA CCACCTCAAC CTCTGCTGAG 28 80 

ATTGTGTGCT AGGAACAGCC TTCCCTCCGT TTCCCCTCAG TCAAACTTGA GCCAGCCTCT 29 40 

GGATCGATGT GATCTTATTG C ATGTTTC C A TGGGGTGTAC CTATACTTTA AGCCAATCCT 3 0 00 

GCTGCATTCA CTGCTAAGTT AAATAAAAAG CCAAGAAGAT TTTGCACTGT GCAGATCCTT 3 060 

TGCTATCTGA CTTGCATCTC TTCCCCCACC TGTCAGCTAG CCACCTGCTT GTTTGTGTTG 312 0 

GGATATTTTT T AG C AC C TG A AGCACCATCT GAAAGGGGCA CCATTTTCTT CTTC^TTTC 318 0 

ATCTCACATA TGCTCCCTAA AAATCCTTAA GTTGTCAATC TGATCCCCAG TGTGAGGTTA 3240 

ATGAGCAAAA TTGGTCTTTG GGGCCCTTTT TGTCCAAGCC CCACTGAAAG GCCTCTTCAG 33 00 

AAAACTATTA TCTTTAAAGC CCTACTTTAA CTCCTTAATT CCAGCATACA GCTAAAACTG 33 60 

GATGTATATT CTGGCAAGTA AAGGCTGAGG ACTCCTCTTT AATCCTGAGA TCTAGATAAC 342 0 

TCATGACATT TTATTTGACC AACATAGCAC ATGATGAGAT ATCAAGGTAA TTAAAATAGC 348 0 

ATGCTTGAAA AAAAAATACG TAATCTGTTT CACCTGTAAC TGTTTAAGCC AATAAACTTT 3 540 

TCAAAATTTA TGTAATGTGG GGCTTTTATG TAGCACTTTA CGTTTTCATG CTGC TTATTG 3 600 

TTTTATTCTA CTGAAAAAAA TGAATTTCAA GATTCTCAAC TTTTTTAATT TCAAAAATTG 3 66 0 

TTTATTGTTT TGACTATAGG AATACAAAAT TTCCTATTTT GGGAGAATAA GAACTCTTTT 372 0 

TGTCATTTTT GGCTATGAAT AAACTTTCTG GTCTTTTGAG ACCACCCATT TTTATAGATC 3 7 80 

AGAATCAGAA AACAGGTAAA CCTCACTCAC ACATTTGGAC TCATTTGAAC AAAAATCTAG 3 84 0 

GCCAAAATAC TGAAAAGCCT ATGTGTTTTT TTAATTGGAA GTATATGTAA GGTTAATGCA 3 9 00 

TTTAGTGAAC GTG AC TAAC A AAGACTAATG TGCACATTAA CAGATGTACT TTTTAAGGTT 3960 

TTATGGGAGG CTGTGCATTG CTCAAAAGCT GTTGGGAACG CCTTCTGAAC AGTTGCCTTC 402 0 

AGAACTAGTT TGAGCTGCTC AATAAAACCA GTGACTTTAC TCATAAAAAA AAAAAAAA 407 8 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Glu Thr Arg Tyr Asn Leu Lys Ser Pro Ala Val Lys Arg Leu Met 

15 10 15 

Lys Glu Ala Ala Glu Leu Lys Asp Pro Thr Asp His Tyr His Ala Gin 
20 25 30 

Pro Leu Glu Asp Asn Leu Phe Glu Trp His Phe Thr Val Arg Gly Pro 
35 40 45 

Pro Asp Ser Asp Phe Asp Gly Gly Val Tyr His Gly Arg lie Val Leu 
50 55 60 

Pro Pro Glu Tyr Pro Met Lys Pro Pro Ser lie lie Leu Leu Thr Ala 
65 70 75 80 

Asn Gly Arg Phe Glu Val Gly Lys Lys lie Cys Leu Ser lie Ser Gly 
85 90 95 

His His Pro Glu Thr Trp Gin Pro Ser Trp Ser lie Arg Thr Ala Leu 
100 105 110 

Leu Ala lie lie Gly Phe Met Pro Thr Lys Gly Glu Gly Ala lie Gly 
115 120 125 

Ser Leu Asp Asn Thr Pro Glu Glu Arg Arg Ala Leu Ala Lys Lys Ser 
130 135 140 

Gin Asp Phe Cys Cys Glu Gly Cys Gly Ser Ala Met Lys Asp Val Leu 
145 150 155 160 

Leu Pro Leu Lys Ser Gly Ser Asp Ser Ser Gin Ala Asp Gin Glu Ala 
165 170 175 

Lys Glu Leu Ala Arg Gin lie Ser Phe Lys Ala Glu Val Asn Ser Ser 
180 185 190 

Gly Asn Thr lie Ser Glu Ser Asp Leu Asn His Ser Phe Ser Leu Thr 
195 200 205 

Asp Leu Gin Asp Asp lie Pro Thr Thr Phe Gin Gly Ala Thr Ala Ser 
210 215 220 

Thr Ser Tyr Gly Leu Gin Asn Ser Ser Ala Ala Ser Phe His Gin Pro 
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225 230 235 240 

Thr Gin Pro Val Ala Lys Asn Thr Ser Met Ser Pro Arg Gin Arg Arg 
245 250 255 

Ala Gin Gin Gin Ser Gin Arg Arg Leu Ser Thr Ser Pro Asp Val lie 
260 265 270 

Gin Gly His Gin Pro Arg Asp Asn His Thr Asp His Gly Gly Ser Ala 
275 280 285 

Val Leu lie Val lie Leu Thr Leu Ala Leu Ala Ala Leu lie Phe Arg 
290 295 300 

Arg lie Tyr Leu Ala Asn Glu Tyr lie Phe Asp Phe Glu Leu 
305 310 315 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1866 base pairs 

(B) TYPE : nucleic acid 

( C ) STRANDEDNESS : doubl e 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GGCGAGTTCC GGATCCCTGC CTAGCGCGGC CCAACCTTTA CTCCAGAGAT CATGGCTGCC 60 

GAGGATGTGG TGGCG AC TGG CGCCGACCCA AGCGATCTGG AGAGCGGCGG GCTGCTGCAT 12 0 

GAGATTTTCA CGTCGCCGCT CAACCTGCTG CTGCTTGGCC TCTGCATCTT CCTGCTCTAC 18 0 

AAGATCGTGC GCGGGGACCA GCCGGCGGCC AGCGGCGACA GCGACGACGA CGAGCCGCCC 24 0 

CCTCTGCCCC GCCTCAAGCG GCGCGACTTC ACCCCCGCCG AGCTGCGGCG CTTCGACGGC 3 00 

GTCCAGGACC CGCGCATACT CATGGCCATC AACGGCAAGG TGTTCGATGT GACCAAAGGC 3 60 

CGCAAATTCT ACSGGCCCGA RGGGCCGTAT GGGGTCTTTG CTGGAAGAGA TGCATCCAGG 42 0 

GGCCTTGCCA CATTTTGCCT GGATAAGGAA GCACTGAAGG ATGAGTACGA TGACCTTTCT 480 

GACCTCACTG CTGCCCAGCA GGAGACTCTG AGTGACTGGG AGTCTCAGTT CACTTTCAAG 54 0 

TATCATCACG TGGGCAAACT GCTGAAGGAG GGGGAGGAGC CCACTGTGTA CTCAGATGAG 60 0 

GAAGAACCAA AAGATGAGAG TGCCCGGAAA AATGATTAAA GC ATTCAGTG GAAGTATATC 66 0 

TATTTTTGTA TTTTGCAAAA TCATTTGTAA CAGTCCACTC TGTCTTTAAA ACATAGTGAT 72 0 
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TACAATATTT AGAAAGTTTT GAGCACTTGC TATAAGTTTT TTAATTAACA TCACTAGTGA 7 80 

C AC T AAT AAA ATTAACTTCT TAGAATGCAT GATGTGTTTG TGTGTCACAA ATCCAGAAAG 84 0 

TGAACTGCAG TGCTGTAATA CACATGTTAA TACTGTTTTT CTTCTATCTG TAGTTAGTAC 9 00 

AGGATGAATT TAAATGTGTT TTTCCTGAGA GACAAGGAAG ACTTGGGTAT TTCCCAAAAC 96 0 

AGGTAAAAAT CTTAAATGTG CACCAAGAGC AAAGGATCAA CTTTTAGTCA TGATGTTCTG 102 0 

TAAAGACAAC AAATCCCTTT TTTTTTCTCA ATTGACTTAA CTGCATGATT TCTGTTTTAT 10 8 0 

CTACCTCTAA AGCAAATCTG CAGTGTTCCA AAGACTTTGG TATGGATTAA GCGCTGTCCA 114 0 

GTAACAAAAT GAAATCTCAA AACAGAGCTC AGCTGCAAAA AAGCATATTT TCTGTGTTTC 12 00 

TGGACTGCAC TGTTGTCCTT GCCCTCACAT AGACACTCAG ACACCCTCAC AAACACAGTA 12 6 0 

GTCTATAGTT AGGATTAAAA TAGGATCTGA ACATTCAAAA GAAAGCTTTG GAAAAAAAGA 13 2 0 

GCTGGCTGGC CTAAAAACCT AAATATATGA TGAAGATTGT AGGACTGTCT TCCCAAGCCC 13 8 0 

CATGTTCATG GTGGGGCAAT GGTTATTTGG TTATTTTACT CAATTGGTTA CTCTCATTTG 144 0 

AAATGAGGGA GGGACATACA GAATAGGAAC AGGTGTTTGC TCTCCTAAGA GCCTTCATGC 15 0 0 

ACACCCCTGA ACCACGAGGA AACAGTACAG TCGCTAGTCA AGTGGTTTTT AAAGTAAAGT 15 6 0 

ATATTCATAA GGTAACAGTT ATTCTGTTGT TATAAAACTA TACCCACTGC AAAAGTAGTA 162 0 

GTCAAGTGTC TAGGTCTTTG ATATTGCTCT TTTGGTTAAC ACTAAGCTTA AGTAGACTAT 16 80 

ACAGTTGTAT GAATTTGTAA AAGTATATGA ACACCTAGTG AGATTTCAAA CTTGTAATTG 17 40 

TGGTTAAATA GTCATTGTAT TTTCTTGTGA ACTGTGTTTT ATGATTTTAC CTCAAATCAG 18 0 0 

AAAACAAAAT GATGTGCTTT GGTCAGTTAA TAAAAATGGT TTTACCCACT AAAAAAAAAA 18 6 0 

AAAAAAAA 1868 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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Met Ala Ala Glu Asp Val Val Ala Thr Gly Ala Asp Pro Ser Asp Leu 
15 10 15 

Glu Ser Gly Gly Leu Leu His Glu lie Phe Thr Ser Pro Leu Asn Leu 
20 25 30 

Leu Leu Leu Gly Leu Cys lie Phe Leu Leu Tyr Lys lie Val Arg Gly 
35 40 45 

Asp Gin Pro Ala Ala Ser Gly Asp Ser Asp Asp Asp Glu Pro Pro Pro 
50 55 60 

Leu Pro Arg Leu Lys Arg Arg Asp Phe Thr Pro Ala Glu Leu Arg Arg 
65 70 75 80 

Phe Asp Gly Val Gin Asp Pro Arg lie Leu Met Ala He Asn Gly Lys 
85 90 95 

Val Phe Asp Val Thr Lys Gly Arg Lys Phe Tyr Xaa Pro Glu Gly Pro 
100 105 110 

Tyr Gly Val Phe Ala Gly Arg Asp Ala Ser Arg Gly Leu Ala Thr Phe 
115 120 125 

Cys Leu Asp Lys Glu Ala Leu Lys Asp Glu Tyr Asp Asp Leu Ser Asp 
130 135 140 

Leu Thr Ala Ala Gin Gin Glu Thr Leu Ser Asp Trp Glu Ser Gin Phe 
145 150 155 160 

Thr Phe Lys Tyr His His Val Gly Lys Leu Leu Lys Glu Gly Glu Glu 
165 170 175 

Pro Thr Val Tyr Ser Asp Glu Glu Glu Pro Lys Asp Glu Ser Ala Arg 
180 185 190 

Lys Asn Asp 
195 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AACCGTTGCT GGGTGTCCCA GGGCCTGAGG CAGGACGGTA CTCCGCTGAC ACCTTCCCTT 6 0 
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TCGGCCTTGA GGTTCCCAGC CTGGTGGCCC CAGGACGTTC CGGTCGCATG GCAGAGTGCT 12 0 

ACGGACGACG CCTATGAAGC CCTTAGTCCT TCTAGTTGCG CTTTTGCTAT GGCCTTCGTC 180 

TGTGCCGGCT TATCCGAGCA TAACTGTGAC ACCTGATGAA GAGCAAAACT TGAATCATTA 24 0 

TATACAAGTT TTAGAGAACC TAGTACGAAG TGTTCCCTCT GGGGAGCCAG GTCGTGAGAA 3 00 

AAAATCTAAC TCTCCAAAAC ATGTTTATTC TATAGCATCA AAGGGATCAA AATTTAAGGA 3 60 

GCTAGTTACA CATGGAGACG CTTCAACTGA GAATGATGTT TTAACCAATC CTATCAGTGA 42 0 

AGAAACTACA ACTTTCCCTA CAGGAGGCTT CACACCGGAA ATAGGAAAGA AAAAACACAC 48 0 

GGAAAGTACC CCATTCTGGT CGATCAAACC AAACAATGTT TCCATTGTTT TGCATGCAGA 540 

GGAACCTTAT ATTGAAAATG AAGAGCCAGA GCCAGAGCCG GAGCCAGCTG CAAAACAAAC 600 

TGAGGCACCA AGAATGTTGC CAGTTGTTAC TGAATCATCT ACAAGTCCAT ATGTTACCTC 6 60 

ATACAAGTCA CCTGTCACCA CTTTAGATAA GAGCACTGGC ATTGAGATCT CTACAGAATC 72 0 

AGAAGATGTT CCTCAGCTCT CAGGTGAAAC TGCGATAGAA AAACCCGAAG AGTTTGGAAA 7 80 

GCACCCAGAG AGTTGGAATA ATGATGACAT TTTGAAAAAA ATTTTAGATA TTAATTCACA 84 0 

AGTGCAACAG GCACTTCTTA G TGACACC AG CAACCCAGCA TATAGAGAAG ATATTGAAGC 90 0 

CTCTAAAGAT CACCTAAAAC GAAGCCTTGY TCTAGCAGCA GCAGCAGAAC ATAAATTAAA 96 0 

AACAATGTAT AAGTCCCAGT TATTGCCAGT AGGACGAACA AGTAATAAAA TTGATGACAT 102 0 

CGAAACTGTT ATTAACATGC TGTGTAATTC T AG ATC T AAA CTCTATGAAT ATTTAGATAT 108 0 

TAAATGTGTT CCACCAGAGA TGAGAGAAAA AGCTGCTACA GTATTCAATA CATTAAAAAA 114 0 

TATGTGTAGA TCAAGGAGAG TCACAGCCTT ATTAAAAGTT TATTAAACAA TAATATAAAA 12 0 0 

ATTTTAAACC TACTTGATAT TCCATAACAA AGCTGATTTA AGCAAACTGC ATTTTTTCAC 126 0 

AGGAGAAATA ATCATATTCG TAATTTCAAA AGTTGTATAA AAATATTTTC TATTGTAGTT 13 2 0 

CAAATGTGCC AACATCTTTA TGTGTCATGT GTTATGAACA ATTTTCATAT GCACTAAAAA 13 8 0 

CCTAATTTAA AATAAAATTT TGGTTCAGGA AAAAAAAAAA AAAAAAAA 142 8 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys Pro Leu Val Leu Leu Val Ala Leu Leu Leu Trp Pro Ser Ser 
15 10 15 

Val Pro Ala Tyr Pro Ser lie Thr Val Thr Pro Asp Glu Glu Gin Asn 
20 25 30 

Leu Asn His Tyr lie Gin Val Leu Glu Asn Leu Val Arg Ser Val Pro 
35 40 45 

Ser Gly Glu Pro Gly Arg Glu Lys Lys Ser Asn Ser Pro Lys His Val 
50 55 60 

Tyr Ser lie Ala Ser Lys Gly Ser Lys Phe Lys Glu Leu Val Thr His 
65 70 75 80 

Gly Asp Ala Ser Thr Glu Asn Asp Val Leu Thx Asn Pro lie 

85 90 95 

Glu Thr Thr Thr Phe Pro Thr Gly Gly Phe Thr Pro Glu He Gly Lys 
100 105 110 

Lys Lys His Thr Glu Ser Thr Pro Phe Trp Ser He Lys Pro Asn Asn 
115 120 125 

Val Ser He Val Leu His Ala Glu Glu Pro Tyr He Glu Asn Glu Glu 
130 135 140 

Pro Glu Pro Glu Pro Glu Pro Ala Ala Lys Gin Thr Glu Ala Pro Arg 
145 150 155 160 

Met Leu Pro Val Val Thr Glu Ser Ser Thr Ser Pro Tyr Val Thr Ser 
165 170 175 

Tyr Lys Ser Pro Val Thr Thr Leu Asp Lys Ser Thr Gly He Glu He 
180 185 190 

Ser Thr Glu Ser Glu Asp Val Pro Gin Leu Ser Gly Glu Thr Ala He 
195 200 205 

Glu Lys Pro Glu Glu Phe Gly Lys His Pro Glu Ser Trp Asn Asn Asp 
210 215 220 

Asp He Leu Lys Lys He Leu Asp He Asn Ser Gin Val Gin Gin Ala 
225 230 235 240 

Leu Leu Ser Asp Thr Ser Asn Pro Ala Tyr Arg Glu Asp He Glu Ala 
245 250 255 

Ser Lys Asp His Leu Lys Arg Ser Leu Xaa Leu Ala Ala Ala Ala Glu 
260 265 270 
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His Lys Leu Lys Thr Met Tyr Lys Ser Gin Leu Leu Pro Val Gly Arg 
275 280 285 

Thr Ser Asn Lys lie Asp Asp He Glu Thr Val lie Asn Met Leu Cys 
290 295 300 

Asn Ser Arg Ser Lys Leu Tyr Glu Tyr Leu Asp He Lys Cys Val Pro 
305 310 315 320 

Pro Glu Met Arg Glu Lys Ala Ala Thr Val Phe Asn Thr Leu Lys Asn 
325 330 335 

Met Cys Arg Ser Arg Arg Val Thr Ala Leu Leu Lys Val Tyr 
340 345 350 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3742 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GATCCCCACC ACACCACCAG CCCGGCCGCA CGGGGCACTG AGCCGGGTGC TGAGCACCGG 6 0 

AGGCCCCGCC GAGGCCGGGA CTCAGGACCT GCAGAGAAAC GCCTCCTGAT TTTGTCTTAC 12 0 

AATGGAACTT AAAAAGTCGC CTGACGGTGG ATGGGGCTGG GTGATTGTGT TTGTCTCCTT 180 

CCTTACTCAG TTTTTGTGTT ACGGATCCCC AC TAGCTGTT GGAGTCCTGT ACATAGAATG 240 

GCTGGATGCC TTTGGTGAAG GAAAAGGAAA AACAGCCTGG GTTGGATCCC TGGCAAGTGG 3 00 

AGTTGGCTTG CTTGCAAGTC CTGTCTGCAG TCTCTGTGTC TCATCTTTTG GAGCAAGACC 3 60 

TGTCACAATC TTCAGTGGCT TCATGGTGGC TGGAGGCCTG ATGTTGAGCA GTTTTGCTCC 42 0 

CAATATCTAC TTTCTGTTTT TTTCCTATGG CATTGTTGTA GGTGCTCCAA ATATTGCTGT 480 

TTGGAGAAAT CTGCACCTTA ATAAGAGCAG TTATTTGTGA GAAAAAAAAA AGACAAGAAT 54 0 

ATATATGAGA TGGGTGTGAC GAAGATCCAG TCTCTCTACA TGAAGACCAG ACTGATTGCT 600 

CCAGTCTCAG AGATGAAAAC AATAAAGAGA ACTACCCCGA CGCAGGGGCT CTGGTAGAAG 660 

AGACGCGCCG CCCTCTTGGG AGCCGCAGCA GCAGAATGTA GAGGCGACCG TGCTGGTGGA 72 0 

CAGCGTATTG CGACCCAGCA TGGGCAACTT CAAGTCCAGG AAGCCCAAGT CCATCTTCAA 780 
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AGCGGAGAGC GGGAGGAGCC ACGGAGAAAG TCAGGAGACA GAGCATGTGG TATCCAGCCA 840 
GTCAGAGTGT CAGGTGAGAG CAGGAACACC AGCTCATGAG AGTCCACAAA ACAATGCCTT 900 
CAAGTGCCAA GAAACAGTGC GACTTCAACC AAGAATAGAC CAGAGGACTG CCATTTCGCC 9 60 

AAAGGATGCT TTTGAAACTC GGCAGGACTT AAATGAGGAA GAAGCTGCTC AGGTGCATGG 102 0 

AGTCAAGGAC CCGGCGCCAG CATCAACCCA GAGCGTGCTT GCCGATGGGA CAGATTCTGC 1080 

AGACCCCTCA CCAGTCCACA AAGATGGGCA GAATGAGGCC GACAGTGCAC CAGAAGACCT 1140 

CCACTCTGTG GGGACCAGCA GGCTGCTCTA TCACATCACT GATGGTGATA ACCCACTGCT 120 0 

GTCGCCACGA TGCTCCATCT TCAGCCAAAG C C AG AG ATTC AACTTAGACC CCGAGTCAGC 1260 

CCCATCTCCA CCCAGCACTC AGCAGTTTAT GATGCCGCGG AGTTCTTCAC GCTGCAGCTG 132 0 

TGGAGATGGC AAGGAGCCAC AGACCATCAC CCAGCTCACC AAGCACATCC AGAGCCTCAA 13 8 0 

GCGGAAAATT CGGAAATTTG AAGAAAAATT TGAACAAGAA AAGAAATACC GGCCTTCACA 1440 

TGGTGACAAG ACTTCTAATC CTGAAGTCCT GAAATGGATG AATGATTTGG CTAAAGGTCG 1500 

TAAACAGCTC AAAGAACTAA AGCTAAAGCT GTCAGAAGAA CAAGGGAGTG CTCCCAAAGG 1560 

TCCACCTAGA AACCTGTTGT GTGAGCAACC CACAGTCCCC AGAGAAAATG GGAAACCGGA 162 0 

AGCTGCGGGC CCGGAGCCAA GCTCCTCTGG AGAAGAGACT CCAGATGCTG CCTTGACATG 1680 

CCTGAAGGAG AGAAGAGAGC AACTTCCTCC CCAGGAGGAT TCTAAGGTAA CTAAGCAAGA 174 0 

CAAGAACCTC ATAAAGCCGC TTTATGACCG ATACAGAATT ATCAAGCAAA TCTTGTCAAC 1800 

ACCTTCCCTT ATTCCAACAA TTCAGGAGGA AGAGGACTCT GATGAAGACC GTCCACAGGG 186 0 

AAGCCAACAA CCTTCTTTGG CAGATCCAGC ATCTCACCTT CCTGTTGGTG ACCACCTCAC 192 0 

CTACTCTAAT GAGACTGAGC CTGTTAGGGC CCTTTTACCA GATGAAAAGA AAGAAGTAAA 19 8 0 

ACCACCAGCT CTCTCCATGT CTAATTTACA TGAGGCTACC ATGCCTGTAC TTCTTGACCA 2 04 0 

TCTCCGAGAA ACTAGGGCTG ACAAGAAGAG ACTGCGGAAA GCCTTAAGAG AATTTGAAGA 2100 

ACAGTTTTTT AAACAAACAG GAAGAAGTCC ACAAAAGGAA GATAGGATAC CAATGGCAGA 216 0 

TGAGTATTAT GAATATAAGC AC AT AAAAG C CAAACTGAGA CTATTAGAGG TCCTCATCAG 222 0 

CAAGCAAGAT GTGGCCAAAA CTATTTGAGG TTCAGGAAAT GTTATGATCA CTTTCACCCA 22 8 0 

TGATATAAAG TAAAGTTTAT TTTCCTCTGC CATCCTTGCT AAGTAGTTTT GACACAATGA 23 4 0 

AAATGGAAGC ACTTTAGTGG TAGTATTAGC TGTTTTTAAG AAGGAATAGC AAGTTTAATT 2400 

ATATACAAGG AGAAGGGATT TAAACGGGGG GAAGAATACA ACAGGTAGCC ATATAATTGG 2460 
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GAAAAAATTC AGTGTCCTCC ATGCCAAGCA GAAAACTCAT AG TC AAT AC A AGTATTTTTA 
AAAATGTCTA ATATTTTATC AAATCTAAAT AACATAGCTA GGACACTTGT TAGGGAAAGT 
TTATTTAGTA TCCAAAGACT GTTTATGTTG ATGTATGGAA AAGAGCATGA TTTTAAAAAA 
TCAATCATAG GAGGAAAAGA AATTCGCTTT TCAAGTAGGA AGGAATACAG CTAGCAAGAA 
AGCAATTTAT TTGAAACTTC TAATGGATTT TTGAGTGATA AAACATTTAC TACCTTGTCC 
TTTAAGTCTG CTAGGCTCTC AGTACCCTAA AATAAACTAG ATTGTGTTGC TATTTTTTTT 
CTTTCTCTAT AAAAATAACA CATTATTTTA TCCGTTATTT GAAATTTTAC ATTTCTGGTT 
ACCAAAGTTC ATTCTGATAG CATGTACTTT GTGAATTATT ATCTTTGTCT ATAACTGACA 
GATGTTTATA TTAAAATAAA ATATTGTATT AAAAATTTAA AATAGGTATT TTGGATAGAT 
ATGTGTCTGT AGTATATAAT CTAATGTGTC CATAGTATTA TTGCTAATCT TTTGGTTTAC 
TATAAGATGA TATAACTATT TTTTCATTGG GAATATACAT TTTTCTTAAT GTTCCAACAT 
CTATACTTTG TAAAGTCAAA ACATTTCCCA TGAGCTGTAG TTATTCATCC TTCTGTACAA 
AATGAAAAGT TTGGAAATTG TTTGCCCTGA TACCTTGAAA AAGAAGCCAG AATATTTATT 
TGCTTCATCA ACTTCAGTGT ATATCATTTT GTGTTATTTT ATACGAAAAC ATG TTTATTA 
TTTTCATTTT TGTAAAAGGA AGTAAAAGGT CAACATTTTC TCTCATGTAC CAACCTTGTT 
TGTATTTCTA TTTTTCTGTA ATGTTTAAGT ATGGATGTTG GAAGAAATTC AACATTCTCT 
TATAGTTTGG ATGGGAAGAC TATTGACTAT TTCAGAAACA GACTTATTTC AGAGGCTTAT 
TGTTTTCTCT GTATTTACCT AATATTTTAT AAC TTTTATG AATCAGAATA ATGTCCTTCA 
TAAATTTGTT TAATTGAAGT CATCTACTTY TAACAGGACA GATACACAAC TATTTGAGGT 
TTACAAATTA CATCTTTGAT AAGGGAAATG GTTTCGTGAC ATGTACACAG TTGCTATTAA 
AATGTAACTC TATATATTCT ATATGATTGT AAATATTTTA TACAACAATA CAAATAAAAT 
ATTTT TCTAT TAAAAAAAAA AA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3742 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Gly Asn Phe Lys Ser Arg Lys Pro Lys Ser lie Phe Lys Ala Glu 
15 10 15 

Ser Gly Arg Ser His Gly Glu Ser Gin Glu Thr Glu His Val Val Ser 
20 25 30 

Ser Gin Ser Glu Cys Gin Val Arg Ala Gly Thr Pro Ala His Glu Ser 
35 40 45 

Pro Gin Asn Asn Ala Phe Lys Cys Gin Glu Thr Val Arg Leu Gin Pro 
50 55 60 

Arg lie Asp Gin Arg Thr Ala lie Ser Pro Lys Asp Ala Phe Glu Thr 
65 70 75 80 

Arg Gin Asp Leu Asn Glu Glu Glu Ala Ala Gin Val His Gly Val Lys 
85 90 95 

Asp Pro Ala Pro Ala Ser Thr Gin Ser Val Leu Ala Asp Gly Thr Asp 
100 105 110 

Ser Ala Asp Pro Ser Pro Val His Lys Asp Gly Gin Asn Glu Ala Asp 
115 120 125 

Ser Ala Pro Glu Asp Leu His Ser Val Gly Thr Ser Arg Leu Leu Tyr 
130 135 140 

His lie Thr Asp Gly Asp Asn Pro Leu Leu Ser Pro Arg Cys Ser lie 
145 150 155 160 

Phe Ser Gin Ser Gin Arg Phe Asn Leu Asp Pro Glu Ser Ala Pro Ser 
165 170 175 

Pro Pro Ser Thr Gin Gin Phe Met Met Pro Arg Ser Ser Ser Arg Cys 
180 185 190 

Ser Cys Gly Asp Gly Lys Glu Pro Gin Thr lie Thr Gin Leu Thr Lys 
195 200 205 

His lie Gin Ser Leu Lys Arg Lys lie Arg Lys Phe Glu Glu Lys Phe 
210 215 220 

Glu Gin Glu Lys Lys Tyr Arg Pro Ser His Gly Asp Lys Thr Ser Asn 
225 230 235 240 

Pro Glu Val Leu Lys Trp Met Asn Asp Leu Ala Lys Gly Arg Lys Gin 
245 250 255 

Leu Lys Glu Leu Lys Leu Lys Leu Ser Glu Glu Gin Gly Ser Ala Pro 
260 265 270 

Lys Gly Pro Pro Arg Asn Leu Leu Cys Glu Gin Pro Thr Val Pro Arg 
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275 280 285 

Glu Asn Gly Lys Pro Glu Ala Ala Gly Pro Glu Pro Ser Ser Ser Gly 
290 295 300 

Glu Glu Thr Pro Asp Ala Ala Leu Thr Cys Leu Lys Glu Arg Arg Glu 
305 310 315 320 

Gin Leu Pro Pro Gin Glu Asp Ser Lys Val Thr Lys Gin Asp Lys Asn 
325 330 335 

Leu lie Lys Pro Leu Tyr Asp Arg Tyr Arg lie lie Lys Gin lie Leu 
340 345 350 

Ser Thr Pro Ser Leu lie Pro Thr lie Gin Glu Glu Glu Asp Ser Asp 
355 360 365 

Glu Asp Arg Pro Gin Gly Ser Gin Gin Pro Ser Leu Ala Asp Pro Ala 
370 375 380 

Ser His Leu Pro Val Gly Asp His Leu Thr Tyr Ser Asn Glu Thr Glu 
385 390 395 400 

Pro Val Arg Ala Leu Leu Pro Asp Glu Lys Lys Glu Val Lys Pro Pro 
405 410 415 

Ala Leu Ser Met Ser Asn Leu His Glu Ala Thr Met Pro Val Leu Leu 
420 425 430 

Asp His Leu Arg Glu Thr Arg Ala Asp Lys Lys Arg Leu Arg Lys Ala 
435 440 445 

Leu Arg Glu Phe Glu Glu Gin Phe Phe Lys Gin Thr Gly Arg Ser Pro 
450 455 460 

Gin Lys Glu Asp Arg lie Pro Met Ala Asp Glu Tyr Tyr Glu Tyr Lys 
465 470 475 480 

His lie Lys Ala Lys Leu Arg Leu Leu Glu Val Leu lie Ser Lys Gin 
485 490 495 

Asp Val Ala Lys Thr lie 
500 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2061 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 

AATCGTCGGG AAGTGTTTTT GAGAAGTCTC GGTCGGTAAG GGAAGTCTTC CAAGTCCGTG 6 0 

CAGCACTAAC GTATTGGCAC CTGCCTCCTC TTCGGCCACC CCCCAGATGA GGCAGCTGTG 12 0 

ACTGTGTCAA GGGAAGCCAC GACTCTGACC ATAGTCTTCT CTCAGCTTCC ACTGCCGTCT 180 

CCACAGGTGG GCTTCACTTT CGTGGAATCC TTGGGCTGCC GAGTTACACC TTAGGAATCC 240 

TCTAATTTTC TTTCCACCTT TTGCACGCAC GCCAGGAGAT TTCTTTTCTT CATCTGTCCA 3 00 

GTGAGGTTAC CGTTTTTACT TCACAGGATT GTTGTGAAGA CCGAATTGCC AAGTGCAGTT 3 60 

CCTGGCGCGG AGTAGGCAGG TCTTATAAAT ATTGGTTCAG TCTGAAGTTT ATCCTGGTTG 420 

TTTCCCTTCT GATAATTTTT TAAGCACTTT TTATTTGCTG GGTGTTTTCA CATACTTGAT 480 

GGCCATCTGA CAGATGAGCA AGGAGGCTCA GAAGCTCAGC TTAAGATTTA AAAAAAAGCA 540 

GGGGGGCTAG AATTTAAATC AAGGTCTATC TGATGTCTAA GCTACCTATT CTGTTATACT 600 

GCATAATACC CTTTTTATAT TATTTTTTAT ATTTAATCAG TAACATATGT AGATAGTACA 660 

AAATTCAACA GATATCAAAG TGTGTTAAGT TTACCTTTCC ACCCACTTTC TCATTTTTGT 72 0 

CTCCCCCAGT TCCTTTTGCA TTATTCCACG TATATTCTGT GCATATATAC ATTCATATAC 7 80 

ATTTATCTGT ATGTGTCAGC TTCTTTTTAC ACAAATGATA CATAAACACT GTTCTGGACC 840 

TTCCAACTTA GAATTACTGC AAACAGTGTC GTGATGAATT ACCTAATTCT GTGTATGTGT 900 

GTATATTGGT AGAAAAAATT CCCGGAAGTA GAATTGCTAG AACAAAGATT TATGCATTTT 960 

AAATATTCCT TTATTATAAA ACTAATGAAA GTAAACATGT TGGCTATGAC CACGTATGCT 102 0 

CTATGCTCAG TTTTTCTAGA GTTGTGTATG CTTAATATAG GAGTAAGATT CTTTTAAAAT 10 8 0 

GGTATATTCA TTGCCTTATT TGATTTTCAT AGTCAATCGT TTTAATTTTT CAGTCTACAT 114 0 

ATATAGGTGT TTGGAAAGGA TATAAATATC TTCTGCTGCA TGTACCTACA GTGATAAACT 1200 

CTCTCCTCCT ACATACCTTT GAGATTTTTT TTTTTTTTTT GAGACAGAGT CTCTCTCTGT 12 60 

CACTCAGGCT GGAGTGCAGT GGCACAGTCT GGGCTCACTG CATCCTCTGC CTACCGGGTT 13 2 0 

CAAGCAGTTC TCCTGCCTCA GCCTCTCGAG TAGCTGGGAT TACAGGCACC TGTCACCACG 13 80 

CCTGGCTAAT TTTTGTATTT TTAGTTGAGA CGGGGTTTCA CCATGTTGGG CAGGCTAGTC 144 0 

TCGAACTCCT GACCTCAAGT GATCCGCCTG CCTTGGCCTC CCACAGTGTT GGGATTACAG 150 0 

GTGTGAGCCA CCGTGCCTGG CCTACCTTTG AGATTTGTGA TGAGGAAACA AGAGATGAAT 15 6 0 
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TGTATGAGAG CACTTCAAAA GATTCATGGA AAATACTTAT TTCAAAAAGA GTAGTTAATA 162 0 

TTACCTTATT TTTCTTATCT GCTAACCCCT TTCTTTCAAA TGCACTTAGG ACTTGCTGCT 168 0 

AAAACTCACT GCAAGTAAGA TACCACAAGG AGGCAGCATA GAACTGATTT TCTATACATG 174 0 

CTCAGGACAG TAGTTTCACT CATAGATGAA AAGTTAGAAT TTGGATTTAT TTGAAATATA 180 0 

TACAAATATT CAAGTATATA CATATATTCA AATAAATACA TATATGTATA TATGTGTGTA 186 0 

TATACACACA TACATACACA TGAATCATCA TTGCCTTCTT GAGATCTCAC CACTTTAGTC 192 0 

CTACTAAGAT GGGTGGTTGT TGGTTTTTTT TTGTTGTTGT TGTTGTTTTT TAAATTCCAA 19 80 

TCTGTATGGA ATGATACTTT AATAAAATTA TGTGCTCGGA TGTTGAATAA ATGTCAAATT 2040 

GCCATAAAAA AAAAAAAAAA A 2061 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met His Phe Lys Tyr Ser Phe lie He Lys Leu Met Lys Val Asn Met 
15 10 15 

Leu Ala Met Thr Thr Tyr Ala Leu Cys Ser Val Phe Leu Glu Leu Cys 
20 25 30 

Met Leu Asn He Gly Val Arg Phe Phe 
35 40 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1772 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGGAAGCGGG TCCCGCAGGT CGCCACGGTT GGGGGAAACG CGGCGGACGC CGCCCCCGTC 60 

CCGAAGGGGA CTCGAAAATG TACAGCCAGC GGTTTGGCAC CGTACAGCGG GAGGTTAAGG 12 0 

GCCCCACCCC CAAAGTGGTG ATCGTGAGAT CCAAGCCTCC TAAAGGCCAA GGAGCTGAGC 180 

ACCATCTAGA AAGAATCCGA CGCAGCCATC AGAAGCATAA TGCTATTTTG GCTTCCATTA 24 0 

AGTCAAGTGA GCGGGATCGC TTGAAAGCTG AGTGGGACCA GCACAATGAC TGCAAGATTT 3 00 

TGGACAGCCT TGTGCGAGCA AGAATCAAGG ATGCTGTGCA AGGGTTTATC ATTAACATTG 3 60 

AAGAAAGACG AAATAAGCTA CGTGAGCTTT TAGCATTAGA AGAAAATGAG TATTTTACAG 42 0 

AAATGCAATT GAAGAAAGAA ACCATTGAGG AGAAAAAAGA TAGGATGAGA GAGAAAACTA 480 

AATTACTAAA AGAGAAGAAT GAAAAAGAGA GGCAGGATTT TGTGGCTGAA AAGCTAGACC 540 

AGCAATTCAG GGAACGCTGT GAGGAGCTCC GTGTTGAATT GTTATCTATC CATCAGAAGA 600 

AGGTGTGTGA GGAGCGGAAA GCACAGATTG CATTTAATGA GGAGCTGAGC AGGCAAAAGC 660 

TGGTGGAAGA GCAGATGTTC TCCAAACTCT GGGAGGAAGA CCGATTAGCC AAGGAAAAGC 72 0 

GAGAAGCCCA AGAGGCGAGG AGACAGAAAG AGCTGATGGA GAACACACGC CTGGGGCTGA 7 80 

ATGCCCAGAT CACCAGCATC AAGGCACAAA GGCAGGCGAC ACAGCTGCTG AAGGAAGAGG 84 0 

AGGCACGCCT TGTGGAAAGT AACAACGCAC AGATTAAACA TGAGAATGAA CAGGATATGC 90 0 

TAAAGAAACA GAAGGCAAAG CAGGAAACTA GGACCATTTT GCAAAAAGCC C T AC AAG AG A 9 60 

GGATAGAACA TATTCAGCAG GAATACAGAG ACGAACAGGA CTTGAACATG AAGCTCGTGC 102 0 

AAAGGGCCCT TCAAGACTTA CAGGAAGAGG CAGATAAAAA GAAACAAAAA AGAGAAGATA 108 0 

TGATAAGAGA ACAGAAGATA TACCATAAAT ATTTGGCACA GAGACGTGAG GAAGAAAAAG 1140 

CTCAGGAGAA AGAATTTGAC AGAATATTAG AGGAAGACAA GGCAAAGAAG TTGGCTGAGA 120 0 

AGGACAAGGA GCTGAGACTT GAAAAGGAGG CAAGGAGACA GCTTGTGGAT GAGGTCATGT 12 6 0 

GTACAAGAAA ACTTCAAGTT CAAGAAAAGT TGCAACGAGA AGCTAAAGAA CAGGAAGAAC 13 2 0 

GTGCTATGGA ACAGAAACAC ATAAATGAAA GTCTTAAAGA ACTTAACTGT GAAGAGAAGG 13 80 

AGAATTTTGC AAGACGCCAA CGTTTAGCCC AGGAGTACAG GAAGCAACTT CAGATGCAAA 144 0 

TCGCCTACCA GCAGCAGTCC CAAGAAGCAG AGAAGGAAGA GAAACGCCGA GAGTTTGAAG 1500 

CAGGTGTAGC AGCAAACAAG ATGTGTTTGG ACAAGGTCCA GGAGGTCCTG TCCACCCATC 156 0 

AAGTGCTGCC TCAAAACATT CATCCCATGC GCAAGGCATG CCCCAGTAAG CTTCCACCGT 162 0 
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AGTTCCGTGA GCATCAATAT ATCTTTTCTT GGTCTTTTAA TATTTTTAAC TACAGTATGC 16 8 0 

TTGTATGCTT CTTTTAACTC CTGGATAAAC TTTTCTTTTT TCCCTGAAAA AAAAAAAAAA 17 40 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 1772 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Tyr Ser Gin Arg Phe Gly Thr Val Gin Arg Glu Val Lys Gly Pro 
15 10 15 

Thr Pro Lys Val Val lie Val Arg Ser Lys Pro Pro Lys Gly Gin Gly 
20 25 30 

Ala Glu His His Leu Glu Arg lie Arg Arg Ser His Gin Lys His Asn 
35 40 45 

Ala lie Leu Ala Ser lie Lys Ser Ser Glu Arg Asp Arg Leu Lys Ala 
50 55 60 

Glu Trp Asp Gin His Asn Asp Cys Lys lie Leu Asp Ser Leu Val Arg 
65 70 75 80 

Ala Arg lie Lys Asp Ala Val Gin Gly Phe lie He Asn lie Glu Glu 
85 90 95 

Arg Arg Asn Lys Leu Arg Glu Leu Leu Ala Leu Glu Glu Asn Glu Tyr 
100 105 110 

Phe Thr Glu Met Gin Leu Lys Lys Glu Thr lie Glu Glu Lys Lys Asp 
115 120 125 

Arg Met Arg Glu Lys Thr Lys Leu Leu Lys Glu Lys Asn Glu Lys Glu 
130 135 140 

Arg Gin Asp Phe Val Ala Glu Lys Leu Asp Gin Gin Phe Arg Glu Arg 
145 150 155 160 

Cys Glu Glu Leu Arg Val Glu Leu Leu Ser lie His Gin Lys Lys Val 
165 170 175 

Cys Glu Glu Arg Lys Ala Gin lie Ala Phe Asn Glu Glu Leu Ser Arg 
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180 185 190 

Gin Lys Leu Val Glu Glu Gin Met Phe Ser Lys Leu Trp Glu Glu Asp 



195 



200 205 



Arg Leu Ala Lys Glu Lys Arg Glu Ala Gin Glu Ala Arg Arg Gin Lys 
210 215 220 

Glu Leu Met Glu Asn Thr Arg Leu Gly Leu Asn Ala Gin lie Thr Ser 
225 230 235 240 

He Lys Ala Gin Arg Gin Ala Thr Gin Leu Leu Lys Glu Glu Glu Ala 
245 250 255 

Arg Leu Val Glu Ser Asn Asn Ala Gin He Lys His Glu Asn Glu Gin 
260 265 270 

Asp Met Leu Lys Lys Gin Lys Ala Lys Gin Glu Thr Arg Thr He Leu 
275 280 285 



Gin Lys Ala Leu Gin Glu Arg ±J.e biu a±a xxc ou.^ m.** — - --=, 
290 295 300 

Asp Glu Gin Asp Leu Asn Met Lys Leu Val Gin Arg Ala Leu Gin Asp 
305 310 315 320 

Leu Gin Glu Glu Ala Asp Lys Lys Lys Gin Lys Arg Glu Asp Met He 
325 330 335 

Arg Glu Gin Lys He Tyr His Lys Tyr Leu Ala Gin Arg Arg Glu Glu 
340 345 350 

Glu Lys Ala Gin Glu Lys Glu Phe Asp Arg He Leu Glu Glu Asp Lys 
355 360 365 

Ala Lys Lys Leu Ala Glu Lys Asp Lys Glu Leu Arg Leu Glu Lys Glu 
370 375 380 

Ala Arg Arg Gin Leu Val Asp Glu Val Met Cys Thr Arg Lys Leu Gin 
385 390 395 400 

Val Gin Glu Lys Leu Gin Arg Glu Ala Lys Glu Gin Glu Glu Arg Ala 
405 410 415 

Met Glu Gin Lys His He Asn Glu Ser Leu Lys Glu Leu Asn Cys Glu 
420 425 430 

Glu Lys Glu Asn Phe Ala Arg Arg Gin Arg Leu Ala Gin Glu Tyr Arg 
435 440 445 

Lys Gin Leu Gin Met Gin He Ala Tyr Gin Gin Gin Ser Gin Glu Ala 
450 455 460 

Glu Lys Glu Glu Lys Arg Arg Glu Phe Glu Ala Gly Val Ala Ala Asn 
465 470 475 480 
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Leu Pro Gin Asn lie His Pro Met Arg Lys Ala Cys Pro Ser Lys 
500 505 510 



Leu 



Pro Pro 

<2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AACGGGAAAT GCTCTGTATC CAAATGC CAG CTGAGATCAT TAGCCACGCT GAAATTCGTT 60 

GACTCGTACT GTGCTAAGGT TGCTGCTAGA CGTTGTTAAC AGAGTTAGTG GTCTTTGGTT 12 0 

ATGGTGGTTA GCAGCCCCAC TTATCTGTTC CATACTACCA GTGAATAGTT TAAAATTCAT 180 

GTTGCACCTA TAATTTATCC CACATAAATA ATTCAGGCTA TTTATTTTGG CATTCAATTG 240 

ATTTTTTTCT CACTTTAAAA AAC TGAGGT A TGGAGACTGG TGCTAGCAAC ACGGGATTGG 3 00 

CTAACGCATC CTCTTGCTGT TCCCGGTGTT TGGGCCTTGC CTGTGACAGT GGGAAAAAAA 3 60 

ATGGCCTTGC TGTGCTACAA CCGGAGCTGC GGTCAGCGCT TCGATCCTGA GACCAATTCC 42 0 

GACGATGCTT GCACATATCA CCCAGGCGTT CCAGTCTTTC ACGATGCATT AAAGGGTTGG 4 80 

TCTTGCTGTA AGAGAAGAAC AACTGATTTT TCTGATTTCT TAAGCATTGT AGGCTGTACA 540 

AAAGGTAGAC ATAATAGTGA GAAGCCACCT GAGCCAGTCA AACCTGAGGT CAAGACTACT 600 

GAGAAGAAGG AACTATCTGA ATTAAAACCA AAATTTCAGG AACACATTCA AGCCCCTAAG 66 0 

ACAGTAGACG C G AT AAAAAG ACCAAGCCCA GATGAACCAA TGACAAATTT GGAATTAAAA 72 0 

ATATCTGCYT CCCTTAAAAC AAGCACTTGA TAAACTTAAA CTGTCATCAG GGAATGAAGA 7 80 

AAATAAGAAA AGAAGACAAT GATGAAATTA AGATTGGGAC CTCATGTAAG AATGGAGGGT 840 

GTTCAAAGAC ATATCGGGGT C T AG AG AGTC TAGAAGAAGT CTGTGTATAT CATTCTGGAG 900 

TACCTATTTT CCATGAGGGG AC G AAAT AC T GGAGCTGTTG TAGAAGAAAA ACTTCTGATT 960 
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TTAATACATT CTTAGACCAA GAGGGCTGTA CAAAAGAGAA ACATGTGGAC TAAAAAAGAT 1020 

GCTGGGAAAA AAGTTGTTCC ATGTAGACAT GACTGGCATC AGACTGGAGG TGAAGTTACC 1080 

ATTTCAGTAT ATGCTAAAAA CTCACTTCCA GAACTTAGCC GAGTAGAAGC AAATAGCACA 1140 

TTGTTAAATG TGCATATTGT ATTTGAAGGA GAGAAGGAAT TTGATCAAAA TGTGAAATTA 1200 

TGGGGTGTGA TTGATGTAAA GCGAAGTTAT GTAAC TATG A CTGCAACAAA GATTGAAATC 1260 

ACTATGAGAA AAGCTGAACC GATGCAGTGG GCAAGCCTTG AACTGCCTGC AGCTAAAAAG 1320 

CAGGAAAAAC AAAAAGACGA CACAGCAGAT TGAGTGGGAG ATGGGAGGAA GGCTATTACG 13 80 

TATTTCAGAA TTTTTAATAC TGTGTGAAGT GGTGGCTTGC TGCTGTCATC TTTTGTTTTG 1440 

TTGTTGTGTT ACTGAATGTG GCATTTCAGG GTTAACATTA GGTTCTTAAA AGCCAAAGTC 1500 

AGTTTATCTT TTTGTGCCTC TCATCTTTCT TTCGTGTTAT GTAAGATTGA TTATTCGTTT 1560 

CTCCCTACTG GTAGGAACCA <_ Liuifti-nun ^u^vj^^^^ 

TAACCATAAT TGCAGTATTT CTTTGTATTT CTCTGTTAAG CAAAGAAATA TTAAGGAACT 1680 

TTTTTTATGT CTTTGTATTA TTC C ATAATT AGTAAAGCTA ATTGTGAATG TCCAATTTTA 17 40 

ATGAAATGTC CAATTTTAAT CAGTTTTTTT CATGGATTTG TGTTCTTACG GTACTTGAAA 1800 

ATATTTAAGG AAGAGATGAA GCTCTGCAGT TTTTTCTATG TGGGATGATT GCTTTTTTAA 1860 

GGAGGATTAA TTCTGAGGTA GTATAGTAAG TAAAGGGGAA TATATGAATT GTTAACAAAT 1920 

TAGGATTTGT TTACAACTAC TTGAATTTTT AAATTATGTC AAAACTTACA TTACTTGCCA 1980 

AGCAGTATGA TGTAAGAGTA TAGGAAACAT AAATAAGAAT ACAGAGGTAT CAATTTGATT 2040 

AAAATTCACC ATTTTATAAG ACTAAGCAAT AATCTTAAAA ACCTCTTTCC TGAATATTTA 2100 

AATGTGTTTG TATGGTGTTA TGACTAATTG TTACTGATTT ACAGACTAAG CCCTCTTAAA 2160 

ACCTTTAGTT AAATATAAAA AGAAATTATA TATATYTKGC CTCCCTGATG GAAAACTATG 222 0 

TAAAATTGTA GACTTAAAAG GTTTGTGGAG GCCGGGCGCG GTGGYTCATG CCTGTAATCC 2280 

CAGCACTTTG GGAGGCCGAG GCGGGCAGAT CACGAGGTCA GGAGATSGAG ACCATCCTGG 2340 

CTAACACGGT GAAACCCCGT CTYTACTAGA AATACAAAAA TTAGCTGGGC GTAGTGGTGG 2 400 

GTGCCTGTAG TCCCAGCTAC TCAGGAGGCT GAGGCAGGAG AATGGCGTGA ACCTAAGAGG 2460 

CGGAGCTGGC AGTGAGCTGA GATTGCGCCA CTGCACTCCA GCCTGGGCGA CAGAGCCAGA 2 520 

CTCCGACTCA AAAAAAAAAA AAAAAAAAAA AAAAA 2555 
(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
{ C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Trp Thr Lys Lys Asp Ala Gly Lys Lys Val Val Pro Cys Arg His 
15 10 15 

Asp Trp His Gin Thr Gly Gly Glu Val Thr lie Ser Val Tyr Ala Lys 
20 25 30 

Asn Ser Leu Pro Glu Leu Ser Arg Val Glu Ala Asn Ser Thr Leu Leu 
35 40 45 

Asn Val His lie Val Phe Glu Gly Glu Lys Glu Phe Asp Gin Asn Val 
50 55 60 

Lys Leu Trp Gly Val lie Asp Val Lys Arg Ser Tyr Val Thr Met Thr 
65 70 75 80 

Ala Thr Lys lie Glu lie Thr Met Arg Lys Ala Glu Pro Met Gin Trp 
85 90 95 

Ala Ser Leu Glu Leu Pro Ala Ala Lys Lys Gin Glu Lys Gin Lys Asp 
100 105 HO 

Asp Thr Ala Asp 
115 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 
AGTGTGGACG GCCCACTGGG TTGGTGGTGG TGGGTGCCCG TC AC AGGGC T GGAGGTGTGG 60 
CCGGCCCACT GGGTTGTGCT TTCTGCCGTA CGTCCCTTCC CATGAGGATG AGATGACCCA 12 0 
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TCTGTTGCAT 


CCCGGCTGCT 


GATAAAACAA 


GACCCTCGGA 


GCCAAGAAAC 


AACACTGAGT 


180 


TCCAGATTTC 


GGAAGGTTCA 


CGAGTGTTGC 


CGACACGCCC 


TCCCAACTGC 


AGACATCCTC 


240 


CCTGGAGGAC 


CTGCTGTGCT 


CACATGCCCC 


CCTGTCCAGC 


GAGGACGACA 


CCTCCCCGGG 


300 


CTGTGCAGCC 


CCCTCCCAGG 


CACCCTTCAA 


GGCCTTCCTC 


AGTCCCCCAG 


AGCCACATAG 


360 


CCACCGAGGC 


ACCGACAGGA 


AGCTGTCCCC 


GCTCCTGAGC 


CCCTTGCAAG 


ACTCACTGGT 


420 


GGACAAGACC 


CTGCTGGAGC 


CCAGGGAGAT 


GGTCCGGCCT 


AAGAAGGTGT 


GTTTCTCGGA 


480 


GAGCAGCCTG 


CCCACCGGGG 


ACAGGACCAG 


GAGGAGCTAC 


TACCTCAATG 


AGATCCAGAG 


540 


CTTCGCGGGC 


GCCGAGAAGG 


ACGCGCGCGT 


GGTGGGCGAG 


ATCGCCTTCC 


AGCTGGACCG 


600 


CCGCATCCTG 


GCCTACGTGT 


TCCCGGGCGT 


GACGCGGCTC 


TACGGCTTCA 


CGGTGGCCAA 


660 


CATCCCCGAG 


AAGATCGAGC 


AGACCTCCAC 


CAAGTCTCTG 


GACGGCTCCG 


TGGACGAGAG 


720 


GAAGCTGCGC 


CiAGCTGACGC 


AGCGCTACCT 


GGCCCTGAGC 


GCGCGCCTGG 


A g AAGC TGGG 


7R0 


CTACAGCCGC 


GACGTGCACC 


CGGCGTTCAG 


CGAGTTCCTC 


ATCAACACCT 


ACGGAATCCT 


840 


GAAGCAGCGG 


CCCGACCTGC 


GCGCCAACCC 


CCTGCACAGC 


AGCCCGGCCG 


CGCTGCGCAA 


900 


GCTGGTCATC 


GACGTGGTGC 


CCCCCAAGTT 


CCTGGGCGAC 


TCGCTGCTGC 


TGCTCAACTG 


960 


CCTGTGCGAG 


CTCTCCAAGG 


AGGACGGCAA 


GCCCCTCTTC 


GCCTGGTGAG 


CCGCCCCGCG 


1020 


CCCGCCGCCT 


TGCCTGCAGT 


AAACGCGTTT 


GTTCCAACCC 


GGGGCCGCGG 


TGCCTCCTGC 


1080 


GCGTCCCCCC 


GGAGGGGAAA 


GGGCCGCGTC 


CCCCGCGCGC 


GAGGCCAGAG 


AAGGCCCCGC 


1140 


TCCCACCGGT 


GCTGGGCCCC 


GACCGCAGCC 


CGCCGCTGCC 


CGCACCTGCG 


GAGTGCTTCT 


1200 


CACCCCTCAT 


TAAAATCATC 


CGTTTGCAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


1260 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA AAAAAAA 




1307 


(2) INFORMATION FOR SEQ ID NO: 18: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
{ C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Met Val Arg Pro Lys Lys Val Cys Phe Ser Glu Ser Ser Leu Pro Thr 
15 10 15 

Gly Asp Arg Thr Arg Arg Ser Tyr Tyr Leu Asn Glu He Gin Ser Phe 
20 25 30 

Ala Gly Ala Glu Lys Asp Ala Arg Val Val Gly Glu He Ala Phe Gin 
35 40 45 

Leu Asp Arg Arg He Leu Ala Tyr Val Phe Pro Gly Val Thr Arg Leu 
50 55 60 

Tyr Gly Phe Thr Val Ala Asn He Pro Glu Lys He Glu Gin Thr Ser 
65 70 75 80 

Thr Lys Ser Leu Asp Gly Ser Val Asp Glu Arg Lys Leu Arg Glu Leu 
85 90 95 

Thr Gin Arg Tyr Leu Ala Leu Ser Ala Arg Leu Glu Lys Leu Gly Tyr 
100 105 110 

Ser Arg Asp Val His Pro Ala Phe Ser Glu Phe Leu He Asn Thr Tyr 
115 120 125 

Gly He Leu Lys Gin Arg Pro Asp Leu Arg Ala Asn Pro Leu His Ser 
130 135 140 

Ser Pro Ala Ala Leu Arg Lys Leu Val He Asp Val Val Pro Pro Lys 
145 150 155 160 

Phe Leu Gly Asp Ser Leu Leu Leu Leu Asn Cys Leu Cys Glu Leu Ser 
165 170 175 

Lys Glu Asp Gly Lys Pro Leu Phe Ala Trp 
180 185 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCGGAAGCTG GAGGAAAACG AAGAGAAGAA GCAGTACCGG GAATCTTACA TCAGTGACAA 60 
CCTGGACCTC G AC ATGG AC C AGCTGGAAAA ACGGTCGCGG GCCAGCGGGA GCAGTGCGGG 12 0 
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CAGCATGAAA CACAAGCGCC TGTCCCGTCA TTCCACCGCC AGCCACAGCA GTTCCCACAC 180 

CTCGGGCATT GAGGCAGACA CCAAGCCCCG GGACACGGGG CCGGAAGACA GCTACTCCAG 2 40 

CAGTGCCATC CACCGCAAGC TGAAAACCTG CAGCTCAATG ACCAGTCATG GCAGCTCCCA 3 00 

CACCTCAGGG GTGGAGAGTG GCGGCAAAGA CCGGCTGGAA GAGGACTTAC AGGACGATGA 3 60 

AATAGAGATG TTGGTTGATG ACCCCCGGGA TCTGGAGCAG ATGAATGAAG AGTCTCTGGA 420 

AGTCAGCCCA GACATGTGCA TCTACATCAC AGAGGACATG CTCATGTCGC GGAAGCTGAA 4 80 

TGGACACTCT GGGTTGATTG TGAAAGAAAT TGGGTCTTCC ACCTCGAGCT CTTCAGAAAC 540 

AGTTGTTAAG CTTCGTGGCC AGAGTACTGA TTCTCTTCCA CAGACTATAT GTCGGAAACC 600 

AAAGACCTCC ACTGATCGAC ACAGCTTGAG CCTCGATGAC ATCAGACTTT ACCAGAAAGA 660 

CTTCCTGCGC ATTGCAGGTC TGTGTCAGGA CACTGCTCAG AGTTACACCT TTGGATGTGG 72 0 



7Rn 



CATCCAAGAT GCTTTTCCAG TCAAAAGAAC CAGCAAATAC TTTTCTCTGG ATCTCACTCA 840 
TGATGAAGTT CCAGAGTTTG TTGTGTAAAG TCCGTCTGTG TGCAGCTGTA CAGGCAGCTT 900 
ACTGTTTGCT AGAGGATGCG AAAGTCATAA GTTCTTTACA TATTACTTGT GCCATATCTT 960 

CTTCACCCTA AACATAGCTC TTTCTTTATA ATATTTGTGA TGATGGAAAC AAAAGCCTTG 102 0 

GAACAATTGC ACTTTAAGTA TTACACAGAA GTAAAAGAAC TACAGAAAAT GTACAGCAAG 108 0 

ACAAGTGCCC GGAAGTTCAC TGATCCTTCA GAAGGAAATG CGCTTTACTG ATTGCAAAGC 114 0 

CTTCAGAATA TTGGAGTGTG GTGTGTTTGC TCATCTGATG CTTTTTAGTT CAGTTACATG 12 0 0 

TAACATCACA TTTTTTTTAT CACGTGAAAG ATGTTAGATT TGTTTGCTTA TAAATTTTTT 12 6 0 

ACCACTCCCA CATAAAATGC TCATAGTTTG GGAGAGGAAA GAGGGAAGAT TCTCTCTTCT 132 0 

TTTAACAGAG AGATGATTGC TCTGTATACC CATTGCTTCC TCCCTGAGGC TGTCCCAAAG 13 8 0 

TGAACACTGA TGGAGTGGTC AAAATCATAA GATTGTAGCA AGCCAAAGAT ACGTATGTGA 144 0 

CGGAAGCACA TAAGCAATAA GCAGAAAACC AGAAGTGCAT GCTGTGATGC CTGTGACTCC 15 0 0 

TTCATCCCGC TCAGTGCCAT GTCCTCTTTT GTGATCTTCC AGAAAGCTCC AGGATTCATT 15 6 0 

TGAGTTCCAC ATCCAAGTAA CAGATGAATT ATATTCATGT TGTAATGCAT TTTGTGGAGT 162 0 

TTACAAAACC AGTGTCTGTT AAAACTTTGG AAAATGTCTT AGAAAACGTT GGTGCTTGGT 16 8 0 

GATGCTTTAT TTGTTTAATT ATCAAGAACA AATTATGGCA ATGCTAGTTT CTGCTTAACC 17 4 0 

AAAATACTCT GTGTATATAT TATACATATA TAAATACATG GGATTGTGTA TGTCTATATG 180 0 
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TGTTTAAAGC TT AC TATGTC TTCATTTTGG CTTCCATGAC TATCTTTTAT ACATGGAATT 186 0 

CCTTAAGATT GAGAATATGT CACTGAGTGA ATGATACCTG CAGACAGTCA GTTGATATAT 19 20 

GTAGAGTTCA GAATGACTGT TTTCTCATGT GCCTTTGGCC ATGATTCTCA ACACTGATTG 19 80 

TATAACAGAA TTTTGGGGGG AGCTTTTAAA AAATAATGAC TGAGTCTCCC ACCAGACCGA 2 040 

TTACATCATT CTCTTGTGGC GGGACCCAAG TAGAATTGCC TTTTCTTTTA AAGTTCTCCA 2100 

GATGGAGCTA ATATGCAACA AAGTTGAAAA CCACTGATCC TGGGGGTGTC TTGTTAATTT 2160 

TGAAGTAAAA GTGTACAGAA GACGTAGTGT ATGAGAAAGG GCCATTTTTA AGACAGTTAC 222 0 

CTGTTGTGCT GCTGTTACAA TATATAATGA AACCAAGTCA GGGGAGTGAA TTTATCAATC 22 80 

TTTTGATGTA AAGTAAAAAC GTAGTTCACA CTTCAGGAGA GAACTTCATA GCACAATGTC 23 40 

TTTCTATAAG ATATTTTTAA TGATTTAGTA TTTTACAACA TTTGTTTACC ATATTTTGAT 24 00 

ATACCATTTT TTTCTATCTG CCCAGTTTTA TTAAAAAAAC TATATATTAT TTTCTAAAGA 24 6 0 

AACAATCATA TTTTTATACA AAATTATGTT TTCAGGTAAC GAAATAGATG TAGGGTACAG 252 0 

TGGAACATAA GCAGTGTTAC CCCTGGCTGG GAGTCAGTAT TATACAACAA ATGGTGAGCT 2 58 0 

GGAACATGCC CTGTCTGTGC TGTCCCTCCT GTGCTGGGTC GCGGATATGT AGGCAACATT 2 64 0 

GCCTTATCAC GCTAGGTTCA CCTGACACTT TAAAAGGAAA AAAAGTTCCA TAGAGTTCTG 27 0 0 

TGGTCACAAA ATTGTTTTGC TTTTATCAAA TACTTTAATA GAACCAAAGT TGCAGATATT 27 6 0 

GGAATGTATG GAAGTATCTC AGTCTCTGCA TAAGAGGATT AAAGTATGAA AGGATCATTT 2 82 0 

AATGACTGTT TTACTTATAA GTCATTAAGT AATCCACCAT TTCTTATGGA TGATGCTTAA 28 80 

GCCTGGTGAG GTTTGTACTC TAAGGAGCCC AGATCATAAT GCAGTGCATT TCCTTAGCCC 2 94 0 

TTAGAGTTTC TTGCAAACAT TTAAAAAAAA GACATATTTA AGAAAGAAAG ATAAAGAAAA 3 00 0 

AACATATTTA ATTACTGTAA ACAGGTACTG CTTTATGTTT ATTTTCTCTC TACTTCAACC 3 0 60 

AAAATCAGAT CTTTGAGGTT TTGCTGACAT TGTTGGTGGT TTTGCACATG TTCTTTCTAA 312 0 

TTGGATTTAT GAATAGTTCT ATGGGTTTTC AAAGATGAAT CATGCTAAGA ACACTTCTGC 3180 

TTTTTGATCC ACTGTTTGCA GCAGAATTAT ATATATGTAT AGGAAAAATC CACTTTGAAT 3 24 0 

AATCCATGTT TTGTATTTGG AAATTGTTTT TAAAAATAAA AAGGAAAGGA AATATAAAAA 33 00 

AAAAAAAAAA AAAAAAAAA 3319 
(2) INFORMATION FOR SEQ ID NO : 2 0 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 264 amino acids 

( B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Asp Gin Leu Glu Lys Arg Ser Arg Ala Ser Gly Ser Ser Ala Gly 
15 10 15 

Ser Met Lys His Lys Arg Leu Ser Arg His Ser Thr Ala Ser His Ser 
20 25 30 

Ser Ser His Thr Ser Gly lie Glu Ala Asp Thr Lys Pro Arg Asp Thr 
35 40 45 

Gly Pro Glu Asp Ser Tyr Ser Ser Ser Ala lie His Arg Lys Leu Lys 
50 55 60 

Thr Cys Ser Ser Met Thr Ser His Gly Ser Ser His Thr Ser Gly Val 
65 70 75 80 

Glu Ser Gly Gly Lys Asp Arg Leu Glu Glu Asp Leu Gin Asp Asp Glu 
85 90 95 

lie Glu Met Leu Val Asp Asp Pro Arg Asp Leu Glu Gin Met Asn Glu 
100 105 HO 

Glu Ser Leu Glu Val Ser Pro Asp Met Cys lie Tyr He Thr Glu Asp 
115 120 125 

Met Leu Met Ser Arg Lys Leu Asn Gly His Ser Gly Leu He Val Lys 
130 135 140 

Glu He Gly Ser Ser Thr Ser Ser Ser Ser Glu Thr Val Val Lys Leu 
145 150 155 160 

Arg Gly Gin Ser Thr Asp Ser Leu Pro Gin Thr He Cys Arg Lys Pro 
165 170 175 

Lys Thr Ser Thr Asp Arg His Ser Leu Ser Leu Asp Asp He Arg Leu 
180 185 190 

Tyr Gin Lys Asp Phe Leu Arg He Ala Gly Leu Cys Gin Asp Thr Ala 
195 200 205 

Gin Ser Tyr Thr Phe Gly Cys Gly His Glu Leu Asp Glu Glu Gly Leu 
210 215 220 

Tyr Cys Asn Ser Cys Leu Ala Gin Gin Cys He Asn He Gin Asp Ala 
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225 



230 



235 



240 



Phe Pro Val Lys Arg Thr Ser Lys Tyr Phe Ser Leu Asp Leu Thr His 
245 250 255 



Asp Glu Val Pro Glu Phe Val Val 
260 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ANATTCAAGAG AGTTGAACTG AATAACCC 29 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ANGGCAGAGCC ACATCCTTCA CAACAGAA 2 9 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



GNGTCTGAGTG TCTATGTGAG GGCAAGGA 



29 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE PE^CRTPTTON : SEQ TD NO: 24: 
TNGGATAAGCC GGCACAGACG AAGGCCAT 2 9 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GNAAGGAAGGA GACAAACACA ATCACCCA 2 9 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 : 



TNTTTGTTCTA GCAATTCTAC TTCCGGGA 



29 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TNCTTCCTCAC GTCTCTGTGC CAAATATT 2 9 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 
ANCTCCAGTCT GATGCCAGTC ATGTCTAC 29 
(2) INFORMATION FOR SEQ ID NO: 29: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " oligonulceotide " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 
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GNCGGCAACAC TCGTGAACCT TCCGAAAT 
(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GNCTTGGAACA ATTGCACTTT AAGTATTA 
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What is claimed is: 

1. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 533 to nucleotide 673; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 596 to nucleotide 673; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 1 to nucleotide 664; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bd379_l deposited under accession 
number ATCC 98361; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bd379_l deposited under accession number ATCC 98361; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone bd379_l deposited under accession number 
ATCC 98361; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone bd379_l deposited under accession number ATCC 98361; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

2. A composition of claim 1 wherein said polynucleotide is operably linked 
to an expression control sequence. 
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3. A host ^transformed with a composition of 2. 

4. The host cell of claim 3, wherein said cell is a mammalian cell. 

5. A process for producing a protein, which comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable culture 
medium; and 

(b) purifying the protein from the culture. 

6. A protein produced according to the process of claim 5. 

7. The protein of claim 6 comprising a mature protein. 

8. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino acid 1 to 
amino acid 44; 

(c) fragments of the amino acid sequence of SEQ ID NO:2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bd379_l deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

9. The composition of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2. 

10. The composition of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2 from amino acid 1 to amino acid 44. 

11. The composition of claim 8, further comprising a pharmaceutical^ 
acceptable carrier. 



95 



BNSDOCID <WO 9840486A2 1 > 



WO 98/40486 PCT/US98/04977 

12. A method for preventing, treating or ameliorating a medical condition 
which comprises administering to a mammalian subject a therapeutically effective amount 
of a composition of claim 11. 

13. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:l. 

14. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 55 to nucleotide 1008; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 952 to nucleotide 1008; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 403 to nucleotide 981; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bpl21_2 deposited under accession 
number ATCC 98361; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bpl21_2 deposited under accession number ATCC 98361; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone bp!21_2 deposited under accession number 
ATCC 98361; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone bpl21_2 deposited under accession number ATCC 98361; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 
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(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

15. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) the amino acid sequence of SEQ ID NO:4 from amino acid 1 19 to 
amino acid 309; 

(c) fragments of the amino acid sequence of SEQ ID NO:4; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bpl21_2 deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

16. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:3. 

17. A composition comprising an isolated polynucleotide selected from the 

group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 52 to nucleotide 639; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 1 to nucleotide 308; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bp646_10 deposited under accession 

number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bp646_10 deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone bp646_10 deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone bp646_10 deposited under accession number ATCC 98361; 
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a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:6; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:6 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

18. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) the amino acid sequence of SEQ ID NO:6 from amino acid 1 to 
amino acid 86; 

(c) fragments of the amino acid sequence of SEQ ID NO:6; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bp646_10 deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

19. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:5. 

20. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 7 from nucleotide 134 to nucleotide 1183; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 191 to nucleotide 1183; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 1 to nucleotide 763; 



98 



BNSDOCiD: <WO 9840486A2 I > 



PCT/US98/04977 

WO 98/40486 

(e) polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone cf50_l deposited under accession number 
ATCC 98361; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone cf50_l deposited under accession number ATCC 98361; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone cf50_l deposited under accession number ATCC 
98361; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone cf50_l deposited under accession number ATCC 98361; 

(i) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:8; 

. .i i i^i -j „^;„„ o nrntoiti rnrnrvrisinf a fraement of the 

(j) a puiyiiutieuuuccui.v""i(, - r <~> ^ 

amino acid sequence of SEQ ID NO:8 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 

of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(j). 

21 . A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:8; 

(b) the amino acid sequence of SEQ ID NO:8 from amino acid 1 to 
amino acid 210; 

(c) fragments of the amino acid sequence of SEQ ID NO:8; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
cf50_l deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

22. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:7. 
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23. 




composition comprising an isolated polynucleotide selected from the 





a polynucleotide comprising the nucleotide sequence of SEQ ID 



NO:9; 



(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 740 to nucleotide 2245; 



length protein coding sequence of clone cwl543_3 deposited under accession 



number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone cwl543_3 deposited under accession number ATCC 98361; 



protein coding sequence of clone cwl543_3 deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone cwl543_3 deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:10 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

24. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:10; 

(b) fragments of the amino acid sequence of SEQ ID NO:10; and 



(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 1 to nucleotide 463; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 



a polynucleotide comprising the nucleotide sequence of the mature 
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(c) the amino acid sequence encoded by the cDNA insert of clone 
cwl543_3 deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

25. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:9. 

26. A composition comprising an isolated polynucleotide selected from the 

group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:ll; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 952 to nucleotide 1074; 

thp rmrlpnfidp sentience of SEO ID 

) a puijr i iuK.jtv,ctJLMv. >_v" * -f-y — -~ 1 — 

NO:ll from nucleotide 524 to nucleotide 1059; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone da389_l deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone da389_l deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone da389_l deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone da389_l deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:12; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 12 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
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27. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:12; 

(b) the amino acid sequence of SEQ ID NO:12 from amino acid 1 to 
amino acid 36; 

(c) fragments of the amino acid sequence of SEQ ID NO:12; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
da389_l deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

28. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:ll. 

29. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:13; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 78 to nucleotide 1619; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 604 to nucleotide 1307; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone dd71_2 deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone dd71_2 deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone dd71_2 deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone dd71_2 deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 14; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:14 having biological activity; 
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(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

30. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:14; 

(b) the amino acid sequence of SEQ ID NO:14 from amino acid 200 to 
amino acid 410; 

z„\ a.****.***^***^ fur* ommn ariH cpnnpnrp nf S*PO TO NTO:1 4: and 
VW uu B lil ^ llJ Wi ' ^ 

(d) the amino acid sequence encoded by the cDNA insert of clone 
dd71_2 deposited under accession number ATCC 98361; 
the protein being substantially free from other mammalian proteins. 

31 . An isolated gene corresponding to the cDNA sequence of SEQ ID NO:13. 

32. A composition comprising an isolated polynucleotide selected from the 

group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 1003 to nucleotide 1350; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 714 to nucleotide 1320; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone dm221_l deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone dm221_l deposited under accession number ATCC 98361; 
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a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone dm221_l deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone dm221_l deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:16; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 16 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

33. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO: 16; 

(b) the amino acid sequence of SEQ ID NO:16 from amino acid 1 to 
amino acid 106; 

(c) fragments of the amino acid sequence of SEQ ID NO:16; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
dm221_l deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

34. An isolated gene corresponding to the cDNA sequence of SEQ ID NO: 15. 

35. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:17; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 449 to nucleotide 1006; 
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(c) f^)lynucleotide comprising the nucle^R sequence of SEQ ID 
NO:17 from nucleotide 1 to nucleotide 331; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone dx279_l deposited under accession 

number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone dx279_l deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone dx279_l deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone dx279_l deposited under accession number ATCC 98361; 

/i \ - i i j-:^4^ ^^smAinrr a nrA^in rnmnrisinp- the amino acid 
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sequence of SEQ ID NO:18; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i). 

36. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:18; 

(b) fragments of the amino acid sequence of SEQ ID NO:18; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
dx279_l deposited under accession number ATCC 98361; 

the protein being substantially free from other mammalian proteins. 

37. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:17. 
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38. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:19; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 19 from nucleotide 74 to nucleotide 865; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 538 to nucleotide 1044; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone gm243_l deposited under accession 
number ATCC 98361; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone gm243_l deposited under accession number ATCC 98361; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone gm243_l deposited under accession number 
ATCC 98361; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone gm243_l deposited under accession number ATCC 98361; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

39. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:20; 

(b) fragments of the amino acid sequence of SEQ ID NO:20; and 
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(c) t^^imino acid sequence encoded by the cDNA insert of clone 
gm243_l deposited under accession number ATCC 98361; 
the protein being substantially free from other mammalian proteins. 

40. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:19. 
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FIGURE 1A 




BamHI 2772 Cial 2666 



Plasmid name: pED6dpc2 
Plasmid size: 5374 bp 

Comments/References: pED6dpc2 is derived from pED6dpc1 by insertion of a new 
polylinker to facilitate cDNA cloning. SST cDNAs are cloned between EcoRI and Notl. 
pED vectors are described in Kaufman et al.(1991), NAR 19: 4485-4490. 
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FIGURE IB 




Plasmid name: pNOTs 
Plasmid size: 4529 bp 



Comm.nU/Ref.r.nee,: pNOTs is a denvat.ve of pMT2 (Kaufman ef a.,1989 M o,.Ce. l .B i o..9:174 1 -,750). 
DHFR was deleted and a new polylinker was inserted between EcoHl and Hpal. M13 ongin 
of replication was inserted in the Clal site SST cDNAs are cloned between EcoRI and 
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SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

This application is a continuation-in-part of Ser. No. 60/XXX,XXX (converted to 
a provisional application from non-provisional application Ser. No. 08/788,789), filed 
January 24, 1997, which is incorporated by reference herein. 

20 FIELD OF THE INVENTION 

The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

25 BACKGROUND OF THE INVENTION 

Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 
clone novel polynucleotides "directly" in the sense that they rely on information directly 
3 0 related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein 
in the case of hybridization cloning; activity of the protein in the case of expression 
cloning). More recent "indirect" cloning techniques such as signal sequence cloning, which 
isolates DNA sequences based on the presence of a now well-recognized secretory leader 
sequence motif, as well as various PCR-based or low stringency hybridization cloning 
3 5 techniques, have advanced the state of the art by making available large numbers of 
DNA/amino acid sequences for proteins that are known to have biological activity by 
virtue of their secreted nature in the case of leader sequence cloning, or by virtue of the 
cell or tissue source in the case of PCR-based techniques. It is to these proteins and the 
polynucleotides encoding them that the present invention is directed. 

40 
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SUMMARY OF THE INDENTION 



In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

5 NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 506 to nucleotide 643; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 471 to nucleotide 765; 

10 (d) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequence of clone AA35__2 deposited under accession 
number ATCC 98303; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AA35_2 deposited under accession number ATCC 98303; 

15 (f) a polynucleotide comprising the nucleotide sequence of the mature 

protein coding sequence of clone AA35_2 deposited under accession number 
ATCC 98303; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AA35_2 deposited under accession number ATCC 98303; 
2 0 (h) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:2; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 

2 5 (a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

3 0 Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

NO:l from nucleotide 506 to nucleotide 643; the nucleotide sequence of SEQ ID NO:l 
from nucleotide 471 to nucleotide 765; the nucleotide sequence of the full-length protein 
coding sequence of clone AA35_2 deposited under accession number ATCC 98303; or the 
nucleotide sequence of the mature protein coding sequence of clone AA35_2 deposited 
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under accession numb^^TCC 98303. In other preferred^mbodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone AA35_2 deposited under accession number ATCC 98303. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
5 comprising the amino acid sequence of SEQ ID NO:2 from amino acid 1 to amino acid 32. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:l. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
1 0 consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino acid 1 to 



(c) fragments of the amino acid sequence of SEQ ID NO:2; and 
15 (d) the amino acid sequence encoded by the cDNA insert of clone 

AA35J2 deposited under accession number ATCC 98303; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:2 or the amino acid sequence 
of SEQ ID NO:2 from amino acid 1 to amino acid 32. 
2 0 In one embodiment, the present invention provides a composition comprising an 

isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

2 5 NO:3 from nucleotide 71 to nucleotide 736; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 113 to nucleotide 736; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 1 to nucleotide 343; 

3 0 (e) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequence of clone AM42_3 deposited under accession 

number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
,n\TA mcorf nfrlnnp AM42 3 deoosited under accession number ATCC 98303; 
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(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AM42„3 deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
5 cDNA insert of clone AM42_3 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity; 
10 (k) a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
15 to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 71 to nucleotide 736; the nucleotide sequence of SEQ ID NO:3 from 
nucleotide 113 to nucleotide 736; the nucleotide sequence of SEQ ID NO:3 from 
nucleotide 1 to nucleotide 343; the nucleotide sequence of the full-length protein coding 

2 0 sequence of clone AM42„3 deposited under accession number ATCC 98303; or the 

nucleotide sequence of the mature protein coding sequence of clone AM42_3 deposited 
under accession number ATCC 98303. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone AM42_3 deposited under accession number ATCC 98303. In yet other preferred 
25 embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:4 from amino acid 1 to amino acid 91. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:3. 

In other embodiments, the present invention provides a composition comprising 

3 0 a protein, wherein said protein comprises an amino acid sequence selected from the group 

consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) the amino acid sequence of SEQ ID NO:4 from amino acid 1 to 
amino acid 91; 
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(c) fra^^nts of the amino acid sequence of SE^tD NO:4; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
AM42_3 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. Preferably such 
5 protein comprises the amino acid sequence of SEQ ID NO:4 or the amino acid sequence 
of SEQ ID NO:4 from amino acid 1 to amino acid 91. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

10 NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 55 to nucleotide 423; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone BG137_7 deposited under accession 

1 5 number ATCC 98303; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone BG137J7 deposited under accession number ATCC 98303; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BG137_7 deposited under accession number 

2 0 ATCC 98303; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BG137_7 deposited under accession number ATCC 98303; 

(g) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:6; 

2 5 (h) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:6 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 

3 0 of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 55 to nucleotide 423; the nucleotide sequence of the full-length 
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protein coding sequence of clone BG137_7 deposited under accession number ATCC 
98303; or the nucleotide sequence of the mature protein coding sequence of clone BG137_7 
deposited under accession number ATCC 98303. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
5 clone BG137J7 deposited under accession number ATCC 98303. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:6 from amino acid 62 to amino acid 
123. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
10 IDNO:5. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

15 (b) the amino acid sequence of SEQ ID NO:6 from amino acid 62 to 

amino acid 123; 

(c) fragments of the amino acid sequence of SEQ ID NO:6; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BG137_7 deposited under accession number ATCC 98303; 

2 0 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:6 or the amino acid sequence 
of SEQ ID NO:6 from amino acid 62 to amino acid 123. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

2 5 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 186 to nucleotide 2030; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO:7 from nucleotide 873 to nucleotide 2030; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 802 to nucleotide 1173; 
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(e) a pc^nucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CH699_1 deposited under accession 
number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
5 cDNA insert of clone CH699_1 deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CH699 Jl deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
± o cDN A insert of clone CH699_1 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:8; 

a nolvnurleotide encoding a protein comprising a fragment of the 

viz — r ~ j ^ ' 

amino acid sequence of SEQ ID NO:8 having biological activity; 
3_ 5 (k) a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 

of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
2 0 to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 186 to nucleotide 2030; the nucleotide sequence of SEQ ID NO:7 
from nucleotide 873 to nucleotide 2030; the nucleotide sequence of SEQ ID NO:7 from 
nucleotide 802 to nucleotide 1173; the nucleotide sequence of the full-length protein 

2 5 coding sequence of clone CH699_1 deposited under accession number ATCC 98303; or the 

nucleotide sequence of the mature protein coding sequence of clone CH699_1 deposited 
under accession number ATCC 98303. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone CH699_1 deposited under accession number ATCC 98303. In yet other preferred 

3 0 embodiments, the present invention provides a polynucleotide encoding a protein 

comprising the amino acid sequence of SEQ ID NO:8 from amino acid 218 to amino acid 
329. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
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In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:8; 

(b) the amino acid sequence of SEQ ID NO:8 from amino acid 218 to 
amino acid 329; 

(c) fragments of the amino acid sequence of SEQ ID NO:8; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CH699_1 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:8 or the amino acid sequence 
of SEQ ID NO:8 from amino acid 218 to amino acid 329. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:10; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO.10 from nucleotide 111 to nucleotide 677; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:10 from nucleotide 156 to nucleotide 677; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone C0851_l deposited under accession 
number ATCC 98303; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone C0851_l deposited under accession number ATCC 98303; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone C0851_l deposited under accession number 
ATCC 98303; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone C0851_l deposited under accession number ATCC 98303; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:ll; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:ll having biological activity; 
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(j) a p^nucleotide which is an allelic variant ora polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

5 (1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:10 from nucleotide 111 to nucleotide 677; the nucleotide sequence of SEQ ID NO:10 
from nucleotide 156 to nucleotide 677; the nucleotide sequence of the full-length protein 

1 0 coding sequence of clone C0851_l deposited under accession number ATCC 98303; or the 
nucleotide sequence of the mature protein coding sequence of clone C0851„l deposited 
under accession number ATCC 98303. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone C0851_l deposited under accession number ATCC 98303. In yet other preferred 

15 embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:ll from amino acid 120 to amino acid 
189. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:10, SEQ ID NO:9 or SEQ ID NO:12 . 

2 0 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:ll; 

(b) the amino acid sequence of SEQ ID NO:ll from amino acid 120 to 

25 amino acid 189; 

(c) fragments of the amino acid sequence of SEQ ID NO:ll; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
C0851_l deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. Preferably such 

3 0 protein comprises the amino acid sequence of SEQ ID NO:ll or the amino acid sequence 

of SEQ ID NO: 11 from amino acid 120 to amino acid 189. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
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(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:13; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 123 to nucleotide 755; 

5 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:13 from nucleotide 279 to nucleotide 755; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 1 to nucleotide 631; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
10 length protein coding sequence of clone CP111_1 deposited under accession 

number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CP111_1 deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
15 protein coding sequence of clone CP111_1 deposited under accession number 

ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CP111_1 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 

2 0 sequence of SEQ ID NO:14; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 14 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

25 (1) a polynucleotide which encodes a species homologue of the protein 

of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

3 0 NO:13 from nucleotide 123 to nucleotide 755; the nucleotide sequence of SEQ ID NO:13 

from nucleotide 279 to nucleotide 755; the nucleotide sequence of SEQ ID NO:13 from 
nucleotide 1 to nucleotide 631; the nucleotide sequence of the full-length protein coding 
sequence of clone CP111_1 deposited under accession number ATCC 98303; or the 
nucleotide sequence of the mature protein coding sequence of clone CP111JL deposited 

10 
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under accession numbe^TCC 98303. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone CP1H_1 deposited under accession number ATCC 98303. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
5 comprising the amino acid sequence of SEQ ID NO:14 from amino acid 1 to amino acid 
171. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:13. 

In other embodiments, the present invention provides a composition comprising 
10 a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 14; 

/w\ »v,o =.rv,iTir, aHH spnuence of SEO ID NO:14 from amino acid 1 to 
amino acid 171; 

1S ( c ) fragments of the amino acid sequence of SEQ ID NO:14; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CP1H_1 deposited under accession number ATCC 98303; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:14 or the amino acid sequence 
20 of SEQ ID NO:14 from amino acid 1 to amino acid 171 . 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

2 5 (b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15 from nucleotide 214 to nucleotide 2760; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 406 to nucleotide 2760; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO:15 from nucleotide 2011 to nucleotide 2565; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CS278_1 deposited under accession 
number ATCC 98303; 
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(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CS278„1 deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CS278_1 deposited under accession number 

5 ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CS278_1 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 16; 

10 (j) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:16 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
15 of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 214 to nucleotide 2760; the nucleotide sequence of SEQ ID NO:15 
2 0 from nucleotide 406 to nucleotide 2760; the nucleotide sequence of SEQ ID NO:15 from 
nucleotide 2011 to nucleotide 2565; the nucleotide sequence of the full-length protein 
coding sequence of clone CS278_1 deposited under accession number ATCC 98303; or the 
nucleotide sequence of the mature protein coding sequence of clone CS278_1 deposited 
under accession number ATCC 98303. In other preferred embodiments, the 

2 5 polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 

clone CS278_1 deposited under accession number ATCC 98303. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO: 16 from amino acid 596 to amino acid 
784. 

3 0 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

ID NO:15. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

12 
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(a) the amino acid sequence of SEQ ID NO: 16; 

(b) the amino acid sequence of SEQ ID NO: 16 from amino acid 596 to 
amino acid 784; 

(c) fragments of the amino acid sequence of SEQ ID NO: 16; and 

5 (d) the amino acid sequence encoded by the cDNA insert of clone 

CS278.J deposited under accession number ATCC 98303; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:16 or the amino acid sequence 
of SEQ ID NO:16 from amino acid 596 to amino acid 784. 
10 In one embodiment, the present invention provides a composition comprising an 

isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
15 NO:17 from nucleotide 901 to nucleotide 1074; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 17 from nucleotide 970 to nucleotide 1074; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 626 to nucleotide 1147; 

2 0 (e) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequence of clone DF968_3 deposited under accession 
number ATCC 98303; 

(£) a polynucleotide encoding the full-length protein encoded by the 
cDN A insert of clone DF968_3 deposited under accession number ATCC 98303; 
25 ( g ) a polynucleotide comprising the nucleotide sequence of the mature 

protein coding sequence of clone DF968.3 deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone DF968J3 deposited under accession number ATCC 98303; 

3 0 (i) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:18; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity; 



13 



BNSDOC1D <WO 9832853A2 I > 



WO 98/32853 PCT/US98/01396 

(k) a polynucleotide which is an allelic Affiant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species hornologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 901 to nucleotide 1074; the nucleotide sequence of SEQ ID NO:17 
from nucleotide 970 to nucleotide 1074; the nucleotide sequence of SEQ ID NO:17 from 
nucleotide 626 to nucleotide 1147; the nucleotide sequence of the full-length protein 
coding sequence of clone DF968_3 deposited under accession number ATCC 98303; or the 
nucleotide sequence of the mature protein coding sequence of clone DF968_3 deposited 
under accession number ATCC 98303. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone DF968_3 deposited under accession number ATCC 98303. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:17. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 18; 

(b) fragments of the amino acid sequence of SEQ ID NO: 18; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
DF968_3 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 18. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:19; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 560 to nucleotide 820; 
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(c) a potynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone DN1120__2 deposited under accession 
number ATCC 98303; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone DN1120J2 deposited under accession number ATCC 98303; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone DN1120_2 deposited under accession number 
ATCC 98303; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone DN1120_2 deposited under accession number ATCC 98303; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

(M a nolvnucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 

of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 560 to nucleotide 820; the nucleotide sequence of the full-length 
protein coding sequence of clone DN1120J2 deposited under accession number ATCC 
98303; or the nucleotide sequence of the mature protein coding sequence of clone 
DN1120_2 deposited under accession number ATCC 98303. In other preferred 
embodiments, the polynucleotide encodes the full-length or mature protein encoded by 
the cDNA insert of clone DN1120„2 deposited under accession number ATCC 98303. In 
yet other preferred embodiments, the present invention provides a polynucleotide 
encoding a protein comprising the amino acid sequence of SEQ ID NO:20 from amino acid 
1 to amino acid 61. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:19. 
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mbodiments, the present invention provides a 



In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:20; 

(b) the amino acid sequence of SEQ ID NO:20 from amino acid 1 to 
amino acid 61; 

(c) fragments of the amino acid sequence of SEQ ID NO:20; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
DN1120_2 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:20 or the amino acid sequence 
of SEQ ID NO:20 from amino acid 1 to amino acid 61. 

In one embodiment the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:21; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:21 from nucleotide 649 to nucleotide 786; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:21 from nucleotide 736 to nucleotide 786; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:21 from nucleotide 525 to nucleotide 787; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone D0589_l deposited under accession 
number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone D0589_l deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone D0589__l deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone D0589_l deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:22; 
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(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:22 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

5 (1) a polynucleotide which encodes a species homologue of the protein 

of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
1 0 NO:21 from nucleotide 649 to nucleotide 786; the nucleotide sequence of SEQ ID NO:21 
from nucleotide 736 to nucleotide 786; the nucleotide sequence of SEQ ID NO:21 from 
nucleotide 525 to nucleotide 787; the nucleotide sequence of the full-length protein coding 
senuence of clone D0589 _1 deposited under accession number ATCC 98303; or the 
nucleotide sequence of the mature protein coding sequence of clone D0589_l deposited 
15 under accession number ATCC 98303. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone D0589JL deposited under accession number ATCC 98303. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:21. 

2 0 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:22; 

(b) fragments of the amino acid sequence of SEQ ID NO:22; and 

2 5 (c) the amino acid sequence encoded by the cDNA insert of clone 

D0589_l deposited under accession number ATCC 98303; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:22. 

In certain preferred embodiments, the polynucleotide is operably linked to an 

3 0 expression control sequence. The invention also provides a host cell, including bacterial, 

yeast, insect and mammalian cells, transformed with such polynucleotide compositions. 
Also provided by the present invention are organisms that have enhanced, reduced, or 
modified expression of the gene(s) corresponding to the polynucleotide sequences 
disclosed herein. 
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Processes are also provided for producing a protein, which comprise: 

(a) growing a culture of the host cell transformed with such 
polynucleotide compositions in a suitable culture medium; and 

(b) purifying the protein from the culture. 

5 The protein produced according to such methods is also provided by the present 
invention. Preferred embodiments include those in which the protein produced by such 
process is a mature form of the protein. 

Protein compositions of the present invention may further comprise a 
pharmaceutically acceptable carrier. Compositions comprising an antibody which 
1 0 specifically reacts with such protein are also provided by the present invention. 

Methods are also provided for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figures 1 A and IB are schematic representations of the pED6 and pNOTs vectors, 
respectively, used for deposit of clones disclosed herein. 



2 0 DETAILED DESCRIPTION 

ISOLATED PROTEINS AND POLYNUCLEOTIDES 

Nucleotide and amino acid sequences, as presently determined, are reported 
below for each clone and protein disclosed in the present application. The nucleotide 
sequence of each clone can readily be determined by sequencing of the deposited clone 

2 5 in accordance with known methods. The predicted amino acid sequence (both full-length 

and mature) can then be determined from such nucleotide sequence. The amino acid 
sequence of the protein encoded by a particular clone can also be determined by 
expression of the clone in a suitable host cell, collecting the protein and determining its 
sequence. For each disclosed protein applicants have identified what they have 

3 0 determined to be the reading frame best identifiable with sequence information available 

at the time of filing. 

As used herein a "secreted" protein is one which, when expressed in a suitable host 
cell, is transported across or through a membrane, including transport as a result of signal 
sequences in its amino acid sequence. "Secreted" proteins include without limitation 
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proteins secreted wholly soluble proteins) or partially (e.g. , receptors) from the cell 
in which they are expressed. "Secreted" proteins also include without limitation proteins 
which are transported across the membrane of the endoplasmic reticulum. 

5 rione"AA35 2" 

A polynucleotide of the present invention has been identified as clone "AA35_2". 
AA35_2 was isolated from a human fetal kidney cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
1 0 analysis of the amino acid sequence of the encoded protein. AA35_2 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"AA35_2 protein"). 

The nucleotide sequence of AA35 2 as presently determined is reported in SEQ 
ID NO:l. What applicants presently believe to be the proper reading frame and the 
15 predicted amino acid sequence of the AA35_2 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:2. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
AA35_2 should be approximately 1400 bp. 

The nucleotide sequence disclosed herein for AA35_2 was searched against the 
20 GenBank and GeneSeq nucleotide sequence databases using BLASTN / BLASTX and 
FASTA search protocols. AA35_2 demonstrated at least some similarity with sequences 
identified as C16789 (Human placenta cDNA 5-end GEN-529D11), H23653 (yn72e01.rl 
Homo sapiens cDNA clone 173976 5' similar to contains Alu repetitive element), L31848 
(Homo sapiens serine /threonine kinase receptor 2 (SKR2) gene, 3 alternative splices, 3' 

2 5 ends), U40455 (Human chromosome X cosmid, clones 196B12, 9H11 and 43H9, repeat 

units and sequence tagged sites), and Z82197 (Human DNA sequence from clone J293L6). 
The predicted amino acid sequence disclosed herein for AA35_2 was searched against the 
GenPept and GeneSeq amino acid sequence databases using the BLASTX search protocol. 
The predicted AA35_2 protein demonstrated at least some similarity to sequences 

3 0 identified as U58658 (unknown [Homo sapiens]) and X55777 (put. ORF [Homo sapiens]). 

Based upon sequence similarity, AA35_2 proteins and each similar protein or peptide may 
share at least some activity. The nucleotide sequence of AA35_2 indicates that it may 
contain an Alu repetitive element. 



19 



BNSDOCID <WO 9832853A2 I > 



WO 98/32853 PCT/US98/01396 
Clone "AM^3 



A polynucleotide of the present invention has been identified as clone M AM42_3 M . 
AM42_3 was isolated from a human fetal kidney cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
5 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. AM42_3 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"AM42J3 protein"). 

The nucleotide sequence of AM42_3 as presently determined is reported in SEQ 

10 ID NO:3. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the AM42_3 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:4. Amino acids 2 to 14 are a predicted 
leader /signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 15, or are a transmembrane domain. 

1 5 The EcoRI / NotI restriction fragment obtainable from the deposit containing clone 

AM42_3 should be approximately 1400 bp. 

The nucleotide sequence disclosed herein for AM42_3 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. AM42_3 demonstrated at least some similarity with sequences 

2 0 identified as AA109637 (mm01f02.rl Stratagene mouse kidney (#937315) Mus musculus 
cDNA clone 520251 5'), AA131170 (zo08e05.sl Stratagene neuroepithelium NT2RAMI 
937234 Homo sapiens cDNA clone 567104 3'), AA131483 (zo08e05.rl Stratagene 
neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 567104 5'), and AA445683 
(vf62h07.rl Barstead MPLRB1 Mus musculus cDNA clone 848413 5'). Based upon 

25 sequence similarity, AM42„3 proteins and each similar protein or peptide may share at 
least some activity. The TopPredll computer program predicts a potential transmembrane 
domain within the AM42_3 protein sequence centered around amino acid 152 of SEQ ID 
NO:4. 



30 Clone "BG137 7" 

A polynucleotide of the present invention has been identified as clone "BG137_7". 
BG137_7 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
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analysis of the amino acid sequence of the encoded protein. BG\W_7 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as n BG137„7 protein"). 

The nucleotide sequence of BG137_7 as presently determined is reported in SEQ 
5 ID NO:5. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the BG137J7 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:6. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
BG137_7 should be approximately 500 bp. 
1 0 The nucleotide sequence disclosed herein for BG 137J7 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. BG137J7 demonstrated at least some similarity with sequences 
iA~~nt;«A *c nR7£R^ miiman mRNA for KIAA0243 eene, partial cds). Based upon 

1UC1 111 JL1\-N-A * ' vw — \~ «^ 

sequence similarity, BG137_7 proteins and each similar protein or peptide may share at 
1 5 least some activity . 

Clone "CH699 1" 

A polynucleotide of the present invention has been identified as clone "CH699_1". 
CH699_1 was isolated from a human fetal kidney cDNA library using methods which are 
2 0 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CH699_1 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "CH699_1 protein"). 

2 5 The nucleotide sequence of CH699_1 as presently determined is reported in SEQ 

ID NO:7- What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CH699_1 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:8. Amino acids 217 to 229 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 

3 0 amino acid 230, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CH699_1 should be approximately 2000 bp. 

The nucleotide sequence disclosed herein for CH699_1 was searched against the 
p^ph nucleotide seauence databases using BLASTN/BLASTX and 

NJtill/Ull" ' ^- ~ * ' 1 J. ~ 
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FASTA search protocols. CH699_1 demonstrated at least some similarity with sequences 
identified as AA155014 (mr99h05.rl Stratagene mouse embryonic carcinoma (#937317) 
Mus musculus cDNA clone 605625 5'), AA423476 (ve76d07.rl Soares mouse mammary 
gland NbMMG Mus musculus cDNA clone 832141 5'), U79271 (Human clones 23920 and 
5 23921 mRNA sequence), and W72147 (zd70f08.sl Soares fetal heart NbHH19W Homo 
sapiens cDNA clone 346023 3'). The predicted amino acid sequence disclosed herein for 
CH699_1 was searched against the GenPept and GeneSeq amino acid sequence databases 
using the BLASTX search protocol. The predicted CH699_1 protein demonstrated at least 
some similarity to sequences identified as X51591 (beta-myosin heavy chain [Homo 
1 0 sapiens]). Based upon sequence similarity, CH699_1 proteins and each similar protein or 
peptide may share at least some activity. 



A polynucleotide of the present invention has been identified as clone "C0851_1 M . 
15 C0851_l was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 



clone, including the entire coding sequence of a secreted protein (also referred to herein 

2 0 as "C0851__l protein"). 

The nucleotide sequence of the 5' portion of C0851_l as presently determined is 
reported in SEQ ID NO:9. An additional internal nucleotide sequence from C0851_l as 
presently determined is reported in SEQ ID NO:10. What applicants believe is the proper 
reading frame and the predicted amino acid sequence encoded by such internal sequence 

2 5 is reported in SEQ ID NO:ll. Amino acids 3 to 15 of SEQ ID NO:ll are a predicted 
leader /signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 16, or are a transmembrane domain. Additional nucleotide sequence from the 
3* portion of C0851_l, including the polyA tail, is reported in SEQ ID NO:12. 



3 0 C0851_l should be approximately 1800 bp. 

The nucleotide sequence disclosed herein for C0851_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/ BLASTX and 
FASTA search protocols. C0851_l demonstrated at least some similarity with sequences 
identified as AA132585 (zo20c04.rl Stratagene colon (#937204) Homo sapiens cDNA clone 



Clone "CQ851 1" 



analysis of the amino acid sequence of the encoded protein. C0851_l is a full-length 



The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
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Soares mouse embryo NbME13.5 14.5 Mus musculus cDNA clone 354136 5'), and X92871 
(X.laevis mRNA for an unknown transmembrane protein). The predicted amino acid 
sequence disclosed herein for C0851_l was searched against the GenPept and GeneSeq 
5 amino acid sequence databases using the BLASTX search protocol. The predicted 
C0851_l protein demonstrated at least some similarity to sequences identified as X92871 
(unknown transmembrane protein [Xenopus laevis]). Based upon sequence similarity, 
C0851_l proteins and each similar protein or peptide may share at least some activity. 
The nucleotide sequence of C0851_l indicates that it may contain an Alu repetitive 
1 0 element. 

Clone "CP111 1" 

Fiashppn identified as clone "CP111_1". 

/\ pUiyiLUA-lCWlieit. ui^^i-ww^-... 

CP111_1 was isolated from a human adult salivary gland cDNA library using methods 
1 5 which are selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or 
was identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CP111_1 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"CP111_1 protein"). 

20 The nucleotide sequence of CPU 1_1 as presently determined is reported in SEQ 

ID NO:13. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CPlll.l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:14. Amino acids 40 to 52 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 

2 5 amino acid 53, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CP111_1 should be approximately 3200 bp. 

The nucleotide sequence disclosed herein for CP111_1 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/ BLASTX and 

3 0 FASTA search protocols. CPU 1_1 demonstrated at least some similarity with sequences 

identified as T53688 (ya98g07.rl Homo sapiens cDNA clone 69756 5) and W70295 
(zd58f03.sl Soares fetal heart NbHH19W Homo sapiens cDNA clone 344861 3 ). The 
predicted amino acid sequence disclosed herein for CPlll.l was searched against the 

rlatehasp.s using" the BLASTX search protocol. 
Vjeiu'epi anu ucuc^ctj cum* ^ — - - ^ 
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The predicted CPni_l protein demonstrated at least some similarity to sequences 
identified as X88852 (env protein [Primate T-cell lymphotropic]). Based upon sequence 
similarity, CP111_1 proteins and each similar protein or peptide may share at least some 
activity. The TopPredll computer program predicts a potential transmembrane domain 
within the CP111_1 protein sequence centered around amino acid 50 of SEQ ID NO:14. 

Clone "CS278 1" 

A polynucleotide of the present invention has been identified as clone "CS278_1". 
CS278_1 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CS278__1 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
M CS278_1 protein"). 

The nucleotide sequence of CS278_1 as presently determined is reported in SEQ 
ID NO:15. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CS278_1 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO: 16. Amino acids 52 to 64 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 65, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CS278_1 should be approximately 4400 bp. 

The nucleotide sequence disclosed herein for CS278_1 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. CS278_1 demonstrated at least some similarity with sequences 
identified as AA234319 (zr66c07.rl Soares NhHMPu SI Homo sapiens cDNA clone 668364 
5'), H44192 (yo73f09.rl Homo sapiens cDNA clone 183593 5'), W18258 (mb86all.rl Soares 
mouse p3NMF19), X76589 (H.sapiens DNA 3' flanking simple sequence region clone 
wg2c3), and Z74652 (M.musculus mRNA; expressed sequence tag (tcc2)). The predicted 
amino acid sequence disclosed herein for CS278_1 was searched against the GenPept and 
GeneSeq amino acid sequence databases using the BLASTX search protocol. The 
predicted CS278_1 protein demonstrated at least some similarity to sequences identified 
as M34651 (ORF-3 protein [Suid herpesvirus 1]). The predicted CS278_1 protein also 
demonstrated at least some similarity to a protein motif, cytochrome P450 cysteine heme- 
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iron hgand signature. BaseWpon sequence similarity, CS278_1 pro" and each similar 
protein or peptide may share at least some activity. The TopPredll computer program 
predicts five potential transmembrane domains within the CS278 JL protein sequence, 
which are centered around amino acids 75, 160, 525, 610, and 700 of SEQ ID NO:16, 
respectively. The nucleotide sequence of CS278_1 may contain GAA simple repeat 
elements. 



rionp"DF968 3" 

A polynucleotide of the present invention has been identified as clone "DF968_3". 
1 0 DF968 3 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

, . r oo^onr* nf thp encoded protein. DF968_3 is a full-length 

analysis ui uitr amuiu ^^~> — — ~- — 

clone, including the entire coding sequence of a secreted protein (also referred to herein 

15 as "DF968_3 protein"). 

The nucleotide sequence of DF968J3 as presently determined is reported in SEQ 
ID NO:17. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the DF968_3 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:18. Amino acids 11 to 23 are a predicted 

2 0 leader /signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 24, or are a transmembrane domain. Another possible DF968_3 reading frame 
and predicted amino acid sequence is encoded by basepairs 191 to 430 of SEQ ID NO:17 

and is reported in SEQ ID NO:33. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

2 5 DF968_3 should be approximately 1010 bp. 

The nucleotide sequence disclosed herein for DF968J3 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN / BL ASTX and 
FASTA search protocols. DF968_3 demonstrated at least some similarity with sequences 
identified as AA426010 ( Z w49el2.sl Soares total fetus Nb2HF8 9w Homo sapiens cDNA 

3 0 clone 773422 3' similar to contains element LTR5 repetitive element), H18256 (yn48a04.rl 

Homo sapiens cDNA clone 171630 5'), and T06820 (EST04709 Homo sapiens cDNA clone 
HFBDZ29). The predicted amino acid sequence disclosed herein for DF968_3 was 
searched against the GenPept and GeneSeq amino acid sequence databases using the 
~ T . r.™ u n P r,r P HirtpH DF968 3 protein demonstrated at least some 

DLrtOlA SCtllUl piuiuv.ui. ' I — » 
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similarity to sequences identified as Z38125 (orf, len 112, CATO.07). Based upon sequence 
similarity, DF968_3 proteins and each similar protein or peptide may share at least some 
activity. The nucleotide sequence of DF968_3 indicates that it may contain repeat 



Clone M DN1120 2" 

A polynucleotide of the present invention has been identified as clone "DN1120_2". 
DN1120_2 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. DN1120_2 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "DN1120_2 protein"). 

The nucleotide sequence of DN1120_2 as presently determined is reported in SEQ 
ID NO:19. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the DN1120_2 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:20. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
DN1120_2 should be approximately 1000 bp. 

The nucleotide sequence disclosed herein for DN1120_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. DN1120_2 demonstrated at least some similarity with sequences 
identified as M62256 (EST00323 Homo sapiens cDNA clone HHCH15 similar to Alu 
repetitive element), M78991 (EST01139 Homo sapiens cDNA clone HHCPG39), Q59179 
(Human brain Expressed Sequence Tag EST00323), and Q61084 (Human brain Expressed 
Sequence Tag EST01139). Based upon sequence similarity, DN1120_2 proteins and each 
similar protein or peptide may share at least some activity. 

Clone "DQ589 1" 

A polynucleotide of the present invention has been identified as clone "D0589_1 M . 
D0589__l was isolated from a human adult testes cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. D0589_l is a full-length 



sequences. 
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clone, including the entire^Bing sequence of a secreted protein (aW -eferred to herein 

as "D0589_l protein"). 

The nucleotide sequence of D0589_l as presently determined is reported in SEQ 
ID NO:21. What applicants presently believe to be the proper reading frame and the 
5 predicted amino acid sequence of the D0589_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:22. Amino acids 17 to 29 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 30, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
1 0 D0589_l should be approximately 1800 bp. 

The nucleotide sequence disclosed herein for D0589_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN / BLASTX and 
FASTA search protocols. D0589 1 demonstrated at least some similarity with sequences 
identified as AA402420 (zu47e04.sl Soares ovary tumor NbHOT Homo sapiens cDNA 
1 5 clone 741150 3'), AA426621 (zw03a09.rl Soares NhHMPu SI Homo sapiens cDNA clone 
768184 5'), AA436749 (zv67cl0.rl Soares total fetus Nb2HF8 9w Homo sapiens cDNA 
clone 758706 5'), H12845 (yjl4h06.rl Homo sapiens cDNA clone 148763 5'), R42350 
(yg01b05.sl Homo sapiens cDNA clone 30909 3'), W02775 (zc65g07.sl Soares fetal heart 
NbHH19W Homo sapiens cDNA clone 327228 3'), W24833 (zc65g07.rl Soares fetal heart), 
2 0 W58173 (zdl9f02.sl Soares fetal heart NbHH19W Homo sapiens cDNA clone 3411 15 3' 
similar to contains Alu repetitive element;contains element LI repetitive element), and 
Z82201 (Human DNA sequence from clone J345P10). Based upon sequence similarity, 
D0589_l proteins and each similar protein or peptide may share at least some activity. 



25 Deposit of Clones 

Clones AA35_2, AM42_3, BG137_7, CH699_1, C0851_l, CP111_1, CS278_1, 
DF968_3, DN1120_2, and DC689_1 were deposited on January 23, 1997 with the American 
Type Culture Collection as an original deposit under the Budapest Treaty and were given 
the accession number ATCC 98303, from which each clone comprising a particular 

3 0 polynucleotide is obtainable. All restrictions on the availability to the public of the 
deposited material will be irrevocably removed upon the granting of the patent, except 
for the requirements specified in 37 C.F.R. § 1.808(b). 

Each clone has been transfected into separate bacterial cells (E. coli) in this 
™mr>™tP dpnosit. Each clone can be removed from the vector in which it was deposited 

' 1 " "I 
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by performing ar^tcoRI/Notl digestion (5 1 site, EcoRI; ^site, NotI) to produce the 
appropriate fragment for such clone. Each clone was deposited in either the pED6 or 
pNOTs vector depicted in Fig. 1. The pED6dpc2 vector ("pED6") was derived from 
pED6dpcl by insertion of a new polylinker to facilitate cDNA cloning (Kaufman et ah, 
5 1991, Nucleic Acids Res. 19: 4485-4490); the pNOTs vector was derived from pMT2 
(Kaufman et aL, 1989, Mol Cell Biol 9: 946-958) by deletion of the DHFR sequences, 
insertion of a new polylinker, and insertion of the M13 origin of replication in the Clal site. 
In some instances, the deposited clone can become "flipped" (i.e., in the reverse 
orientation) in the deposited isolate. In such instances, the cDNA insert can still be 

10 isolated by digestion with EcoRI and NotI. However, NotI will then produce the 5' site 
and EcoRI will produce the 3* site for placement of the cDNA in proper orientation for 
expression in a suitable vector. The cDNA may also be expressed from the vectors in 
which they were deposited. 

Bacterial cells containing a particular clone can be obtained from the composite 

1 5 deposit as follows: 

An oligonucleotide probe or probes should be designed to the sequence that is 
known for that particular clone. This sequence can be derived from the sequences 
provided herein, or from a combination of those sequences. The sequence of the 
oligonucleotide probe that was used to isolate each full-length clone is identified below, 

2 0 and should be most reliable in isolating the clone of interest. 



Clone 


Probe Sequence 


AA35_2 


SEQ ID NO:23 


AM42_3 


SEQ ID NO:24 


BG137_7 


SEQ ID NO:25 


CH699_1 


• SEQIDNO:26 


C0851_l 


SEQ ID NO:27 


CP111_1 


SEQ ID NO:28 


CS278_1 


SEQ ID NO:29 


DF968_3 


SEQ ID NO:30 


DN1120_2 


SEQ ID NO:31 


D0589_l 


SEQ ID NO:32 
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In the sequences listed ab^lwhich include an N at position 2, that^osition is occupied 
in preferred probes/primers by a biotinylated phosphoaramidite residue rather than a 
nucleotide (such as , for example, that produced by use of biotin phosphoramidite (1- 
dimethoxytrityloxy-Z-CN-biotinyl^-aminobutyO-propyl-S-O^Z-cyanoethylJ^N^- 

5 diisopropyl)-phosphoramadite) (Glen Research, cat. no. 10-1953)). 

The design of the oligonucleotide probe should preferably follow these 
parameters: 

(a) It should be designed to an area of the sequence which has the fewest 
ambiguous bases ("N's"), if any; 

1 o (b) It should be designed to have a T m of approx. 80 ° C (assuming 2° for each 

A or T and 4 degrees for each G or C). 
The oligonucleotide should preferably be labeled with g- 32 P ATP (specific activity 6000 
r-; ar.H TA nnlvnnrleotide kinase using commonly employed techniques for 

labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 

1 5 label should preferably be removed by gel filtration chromatography or other established 
methods. The amount of radioactivity incorporated into the probe should be quantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
probe should be approximately 4e+6 dpm/pmole. 

The bacterial culture containing the pool of full-length clones should preferably 

20 be thawed and 100 ul of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampicillin at 100 ug/ml. The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in 
fresh L-broth. Aliquots of these dilutions should preferably be plated to determine the 
dilution and volume which will yield approximately 5000 distinct and well-separated 

2 5 colonies on solid bacteriological media containing L-broth containing ampicillin at 100 

ug/ml and agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C. Other 
known methods of obtaining distinct, well-separated colonies can also be employed. 

Standard colony hybridization procedures should then be used to transfer the 
colonies to nitrocellulose filters and lyse, denature and bake them. 

3 0 The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 

6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na citrate /liter, adjusted to pH 7.0 with 
NaOH) containing 0.5% SDS, 100 ug/ml of yeast RNA, and 10 mM EDTA (approximately 
10 mL per 150 mm filter). Preferably, the probe is then added to the hybridization mix at 
a concentration greater than or equal to le+6 dpm/mL. The filter is then preferably 
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incubated at 65 °C with gentle agitation overnight. The filter is then preferably washed in 
500 mL of 2X SSC/0.5% SDS at room temperature without agitation, preferably followed 
by 500 mL of 2X SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. 
A third wash with 0.1X SSC/0.5% SDS at 65°C for 30 minutes to 1 hour is optional. The 
5 filter is then preferably dried and subjected to autoradiography for sufficient time to 
visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated 
using standard procedures. The clones can then be verified by restriction analysis, 

1 0 hybridization analysis, or DNA sequencing. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cyclized using known methods, for example, 
as described in H.U. Saragovi, et ah, Bio /Technology 10, 773-778 (1992) and in R.S. 

1 5 McDowell, et al.,J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated 
herein by reference. Such fragments may be fused to carrier molecules such as 
immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. For example, fragments of the protein may be fused through "linker" sequences to 
the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 

2 0 could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 
be used to generate such fusions. For example, a protein - IgM fusion would generate a 
decavalent form of the protein of the invention. 

The present invention also provides both full-length and mature forms of the 
disclosed proteins. The full-length form of the such proteins is identified in the sequence 

2 5 listing by translation of the nucleotide sequence of each disclosed clone. The mature form 

of such protein may be obtained by expression of the disclosed full-length polynucleotide 
(preferably those deposited with ATCC) in a suitable mammalian cell or other host cell. 
The sequence of the mature form of the protein may also be determinable from the amino 
acid sequence of the full-length form. 

3 0 The present invention also provides genes corresponding to the polynucleotide 

sequences disclosed herein. "Corresponding genes" are the regions of the genome that 
are transcribed to produce the mRNAs from which cDNA polynucleotide sequences are 
derived and may include contiguous regions of the genome necessary for the regulated 
expression of such genes. Corresponding genes may therefore include but are not limited 
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to coding sequences, 5' anW untranslated regions, alternatively spiced exons, introns, 
promoters, enhancers, and silencer or suppressor elements. The corresponding genes can 
be isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include the preparation of probes or primers from the disclosed 
5 sequence information for identification and /or amplification of genes in appropriate 
genomic libraries or other sources of genomic materials. An "isolated gene" is a gene that 
has been separated from the adjacent coding sequences, if any, present in the genome of 
the organism from which the gene was isolated. 

Organisms that have enhanced, reduced, or modified expression of the gene(s) 
10 corresponding to the polynucleotide sequences disclosed herein are provided. The 
desired change in gene expression can be achieved through the use of antisense 
polynucleotides or ribozymes that bind and/or cleave the mRNA transcribed from the 
/aii .. ^ A/!„~i c ^QQA Trends Pharmacol. Sci. 15(7): 250-254; Lavarosky et al., \997, 

gene will* i'^ixw, - - 

Biochem. Mol. Med. 62(1): 11-22; and Hampel, 1998, Prog. Nucleic Acid Res. Mol. Biol. 58: 1- 
15 39; all of which are incorporated by reference herein). Transgenic animals that have 
multiple copies of the gene(s) corresponding to the polynucleotide sequences disclosed 
herein, preferably produced by transformation of cells with genetic constructs that are 
stably maintained within the transformed cells and their progeny, are provided. 
Transgenic animals that have modified genetic control regions that increase or reduce 
2 0 gene expression levels, or that change temporal or spatial patterns of gene expression, are 
also provided (see European Patent No. 0 649 464 Bl, incorporated by reference herein). 
In addition, organisms are provided in which the gene(s) corresponding to the 
polynucleotide sequences disclosed herein have been partially or completely inactivated, 
through insertion of extraneous sequences into the corresponding gene(s) or through 

2 5 deletion of all or part of the corresponding gene(s). Partial or complete gene inactivation 

can be accomplished through insertion, preferably followed by imprecise excision, of 
transposable elements (Plasterk, 1992, Bioessays 14(9): 629-633; Zwaal et al, 1993, Proc. Natl. 
Acad. Sci. USA 90(16): 7431-7435; Clark et al, 1994, Proc. Natl. Acad. Sci. USA 91(2): 719-722; 
all of which are incorporated by reference herein), or through homologous recombination, 

3 0 preferably detected by positive/negative genetic selection strategies (Mansour et al., 1988, 

Nature 336: 348-352; U.S. Patent Nos. 5,464,764; 5,487,992; 5,627,059; 5,631,153; 5,614, 396; 
5,616,491; and 5,679,523; all of which are incorporated by reference herein). These 
organisms with altered gene expression are preferably eukaryotes and more preferably 
~~~^c ^ro-^icmc; are useful for the development of non-human models for 
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the study of disorders involving the corresponding gene(s), and for the development of 
assay systems for the identification of molecules that interact with the protein product(s) 
of the corresponding gene(s). 

Where the protein of the present invention is membrane-bound (e.g., is a receptor), 
5 the present invention also provides for soluble forms of such protein. In such forms part 
or all of the intracellular and transmembrane domains of the protein are deleted such that 
the protein is fully secreted from the cell in which it is expressed. The intracellular and 
transmembrane domains of proteins of the invention can be identified in accordance with 
known techniques for determination of such domains from sequence information. 

1 0 Proteins and protein fragments of the present invention include proteins with 

amino acid sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 
identity) with that disclosed protein, where sequence identity is determined by comparing 

1 5 the amino acid sequences of the proteins when aligned so as to maximize overlap and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 
(more preferably 20 or more, most preferably 30 or more) contiguous amino acids that 
shares at least 75% sequence identity (more preferably, at least 85% identity; most 

2 0 preferably at least 95% identity) with any such segment of any of the disclosed proteins. 

Species homologs of the disclosed polynucleotides and proteins are also provided 
by the present invention. As used herein, a "species homologue" is a protein or 
polynucleotide with a different species of origin from that of a given protein or 
polynucleotide, but with significant sequence similarity to the given protein or 

2 5 polynucleotide, as determined by those of skill in the art. Species homologs may be 

isolated and identified by making suitable probes or primers from the sequences provided 
herein and screening a suitable nucleic acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides 
or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide 

3 0 which also encode proteins which are identical, homologous, or related to that encoded 

by the polynucleotides . 

The invention also includes polynucleotides with sequences complementary to 
those of the polynucleotides disclosed herein. 
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The present inventio^ilso includes polynucleotides capable^Wiybridizing under 
reduced stringency conditions, more preferably stringent conditions, and most preferably 
highly stringent conditions, to polynucleotides described herein. Examples of stringency 
conditions are shown in the table below: highly stringent conditions are those that are at 
least as stringent as, for example, conditions A-F; stringent conditions are at least as 
stringent as, for example, conditions G-L; and reduced stringency conditions are at least 
as stringent as, for example, conditions M-R. 



Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
(bp)* 


Hybridization Temperature and 
Buffer* 


Wash 

Temperature 
and Buffer' 


A 


DNA:DNA 


* 50 


65 °C; lxSSC-or- 

42°C; lxSSC, 50% formamide 


65 n C; 0.3xSSC 


B 


DNAiDNA 


<50 


T B *; IxSSC 


T B *; IxSSC 


C 


DNA:RNA 


2 50 


67°C; IxSSC -or- 

45 U C; lxSSC, 50% formamide 


67 °C; 0.3xSSC 


D 


DNA:RNA 


<50 


T D *; IxSSC 


T D *; IxSSC 


E 


RNA:RNA 


> 50 


70°C; lxSSC-or- 

50 °C; IxSSC, 50% formamide 


70 °C; 0.3xSSC 


F 


RNA:RNA 


<50 


T F *; IxSSC 


T F *; IxSSC 


G 


DNA:DNA 


z 50 


65 °C; 4xSSC -or- 

42°C; 4xSSC, 50% formamide 


65°C; IxSSC 


H 


DNA:DNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNA:RNA 


a 50 


67 °C; 4xSSC -or- 

45 °C; 4xSSC, 50% formamide 


67°C; IxSSC 


J 


DNA:RNA 


<50 


T,*; 4xSSC 


T,*; 4xSSC 


K 


RNA:RNA 


* 50 


70 °C; 4xSSC -or- 

50°C; 4xSSC, 50% formamide 


67°C; IxSSC 


L 


RNAiRNA 


<50 


V; 2xSSC 


T L *; 2xSSC 


M 


DNA:DNA 


2 50 


50 °C; 4xSSC-or- 

40°C; 6xSSC, 50% formamide 


50°C; 2xSSC 


N 


DNArDNA 


<50 


T N *; 6xSSC 


T N *; 6xSSC 


O 


DNA:RNA 


^ 50 


55 °C; 4xSSC-or- 

42°C; 6xSSC, 50% formamide 


55°C; 2xSSC 


V 


DNA:RNA 


<50 


T P *; 6xSSC 


T, *; 6xSSC 


Q 


RNA:RNA 


2 50 


60°C;4xSSC -or- 

45 °C; 6xSSC, 50% formamide 


60°C; 2xSSC 


R 


RNA:RNA 


<50 


T R *; 4xSSC 


T R *; 4xSSC 



*• The hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides. When 
r * nnl^nrl^riHp to a tareet oolvnucleotide of unknown sequence, the hybrid length is assumed 
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to be that of the hybridSng polynucleotide. When polynucleotides of known sequence are hybridized, the 
hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region 
or regions of optimal sequence complementarity. 

f : SSPE (lxSSPE is 0.15M NaCl, lOmM NaH 2 P0 4 , and 1.25mM EDTA, pH 7.4) can be substituted for SSC 
5 (lxSSC is 0.15M NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 

*Tb - T R : The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should 
be 5-10°C less than the melting temperature (TJ of the hybrid, where T m is determined according to the 
following equations. For hybrids less than 18 base pairs in length, T m (°C) - 2(# of A + T bases) + 4(# of G + 
10 C bases). For hybrids between 18 and 49 base pairs in length, T m (°C) = 81.5 + 16.6(log 10 [Na + ]) + 0.41 (%G+C) - 
(600 /N), where N is the number of bases in the hybrid, and [Na'] is the concentration of sodium ions in the 
hybridization buffer ([Na + ] for lxSSC = 0.165 M). 

Additional examples of stringency conditions for polynucleotide hybridization are 
15 provided in Sambrook, J., E.F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
Laboratory Manual Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, F.M. Ausubel et al., eds., 
John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
2 0 25%(more preferably at least 50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence identity is determined by comparing the sequences of the hybridizing 

2 5 polynucleotides when aligned so as to maximize overlap and identity while minimizing 

sequence gaps. 

The isolated polynucleotide of the invention may be operably linked to an 
expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein 

3 0 recombinantly. Many suitable expression control sequences are known in the art. General 

methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably 
linked" means that the isolated polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way that the protein is expressed 
3 5 by a host cell which has been transformed (transfected) with the ligated 
polynucleotide /expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 
protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 
Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 
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cells, 3T3 cells, CV-1 cells,^Rer transformed primate cell lines, nonlal diploid cells, cell 
strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such 
5 as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cereoisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia calx, Bacillus subtilis, Salmonella typhimurium, or any bacterial 
strain capable of expressing heterologous proteins. If the protein is made in yeast or 
10 bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylate of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 

1.1 .^j, 

The protein may also be produced by operably linking the isolated polynucleotide 
15 of the invention to suitable control sequences in one or more insect expression vectors, 
and employing an insect expression system. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and such methods are 
well known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
2 0 Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

The protein of the invention may be prepared by culturing transformed host cells 
under culture conditions suitable to express the recombinant protein. The resulting 

2 5 expressed protein may then be purified from such culture (i.e., from culture medium or 

cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 
containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA 

3 0 Sepharose®; one or more steps involving hydrophobic interaction chromatography using 

such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which 
f a ««* a »e m.rifiratinn. For example, it may be expressed as a fusion protein, such as 

TT * Ai ^ r 
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those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin 
(TRX). Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, MA), Pharmacia (Piscataway, NJ) and 



5 subsequently purified by using a specific antibody directed to such epitope. One such 
epitope ("Flag") is commercially available from Kodak (New Haven, CT). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
1 0 all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
with the present invention as an "isolated protein." 



1 5 animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

The protein may also be produced by known conventional chemical synthesis. 
Methods for constructing the proteins of the present invention by synthetic means are 
2 0 known to those skilled in the art. The synthetically-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and /or conformational 
characteristics with proteins may possess biological properties in common therewith, 
including protein activity. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 

2 5 compounds and in immunological processes for the development of antibodies. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, modifications in the peptide or DNA 
sequences can be made by those skilled in the art using known techniques. Modifications 

3 0 of interest in the protein sequences may include the alteration, substitution, replacement, 

insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 
amino acid to alter the conformation of the molecule. Techniques for such alteration, 
substitution, replacement, insertion or deletion are well known to those skilled in the art 



InVitrogen, respectively. The protein can also be tagged with an epitope and 



The protein of the invention may also be expressed as a product of transgenic 
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(see, e.g., U.S. Patent No. 4^^584). Preferably, such alteration, subsfrmtion, replacement, 
insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and may thus be useful for screening 
5 or other immunological methodologies may also be easily made by those skilled in the art 
given the disclosures herein. Such modifications are believed to be encompassed by the 
present invention. 

f TSFS AND BIOLOG ICAL ACTIVITY 

1 0 The polynucleotides and proteins of the present invention are expected to exhibit 

one or more of the uses or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
invention may be provided by administration or use of such proteins or by administration 
or use of polynucleotides encoding such proteins (such as, for example, in gene therapies 

15 or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express 
20 recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in disease 
states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 

2 5 with endogenous DNA sequences in patients to identify potential genetic disorders; as 

probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" 
known sequences in the process of discovering other novel polynucleotides; for selecting 
and making oligomers for attachment to a "gene chip" or other support, including for 

3 0 examination of expression patterns; to raise anti-protein antibodies using DNA 

immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
nnlvnnrleotide can also be used in interaction trap assays (such as, for example, that 
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described in Gyuris et aL, Cell 75:791-803 (1993)) to identify polynucleotides encoding the 
other protein with which binding occurs or to identify inhibitors of the binding 
interaction. 

The proteins provided by the present invention can similarly be used in assay to 
determine biological activity, including in a panel of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit another immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

Any or all of these research utilities are capable of being developed into reagent 
grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Cloning: A Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, 
J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to 
Molecular Cloning Techniques", Academic Press, Berger, S.L. and A.R. Kimmel eds., 1987. 

Nutritional Uses 

Polynucleotides and proteins of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein 
or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 
capsules. In the case of microorganisms, the protein or polynucleotide of the invention 
can be added to the medium in or on which the microorganism is cultured. 
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C ytokine and Cell^roliferahon/ Dif ferentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 
inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 
induce production of other cytokines in certain cell populations. Many protein factors 
5 discovered to date, including all known cytokines, have exhibited activity in one or more 
factor dependent cell proliferation assays, and hence the assays serve as a convenient 
confirmation of cytokine activity. The activity of a protein of the present invention is 
evidenced by any one of a number of routine factor dependent cell proliferation assays 
for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, 
10 MC9/G, M + (preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e and CMK. 

The activity of a protein of the invention may, among other means, be measured 

pthods: 

Assays for T-cell or thymocyte proliferation include without limitation those 
1 5 described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. 
Marguhes, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Bertagnolli et al., J. Immunol. 145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 
2 0 133:327-341, 1991; Bertagnolli, et al., J. Immunol. 149:3778-3783, 1992; Bowman et al., J. 
Immunol. 152: 1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node 
cells or thymocytes include, without limitation, those described in: Polyclonal T cell 
stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current Protocols in Immunology. J.E.e.a. 
25 Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human Interferon y, Schreiber, R.D. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
3 0 Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L.S. and Lipsky, P.E. In Current 
Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 
Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 
336:690-692, 1988; Greenberger et al, Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 
surement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in 
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Immunology. J.E.e.a^oligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wfley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 
Interleukin 11 - Bennett, F., Giannotti, J., Clark, S.C. and Turner, K. J. In Current Protocols 
in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
5 Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Clark, S.C. 
and Turner, K.J. In Current Protocols in Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 

10 proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. Margulies, 
E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., 

15 Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et aL, Eur. J. Immun. 
11:405-411, 1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 
140:508-512, 1988. 



Immune Stimulating or Suppressing Activity 
2 0 A protein of the present invention may also exhibit immune stimulating or 

immune suppressing activity, including without limitation the activities for which assays 
are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SOD)), e.g., 
in regulating (up or down) growth and proliferation of T and /or B lymphocytes, as well 

2 5 as effecting the cytolytic activity of NK cells and other cell populations. These immune 

deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal 
infections, or may result from autoimmune disorders. More specifically, infectious 
diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, 

3 0 herpesviruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections 

such as candidiasis. Of course, in this regard, a protein of the present invention may also 
be useful where a boost to the immune system generally may be desirable, i.e., in the 
treatment of cancer. 
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Autoimmune disoBrs which may be treated using a prSein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 
Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
5 myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. 
Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
example, organ transplantation), may also be treatable using a protein of the present 
10 invention. 

Using the proteins of the invention it may also be possible to immune responses, 
in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
™,«mnn.«w» alreadv in progress or may involve preventing the induction of an 

AAA Hi' w»-». ' \p J A ' 

immune response. The functions of activated T cells may be inhibited by suppressing T 
1 5 cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression 
of T cell responses is generally an active, non-antigen-specific, process which requires 
continuous exposure of the T cells to the suppressive agent. Tolerance, which involves 
inducing non-responsiveness or anergy in T cells, is distinguishable from 
immunosuppression in that it is generally antigen-specific and persists after exposure to 
2 0 the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack 
of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 

2 5 high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 

skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 
transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 

3 0 the transplant. The administration of a molecule which inhibits or blocks interaction of 

a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in conjunction with a 
monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., B7- 
i R7.^ or blocking antibodv). prior to transplantation can lead to the binding of the 
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molecule to the natural ligand(s) on the immune cells without transmitting the 
corresponding costimulatory signal. Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, the lack of costimulation may also be sufficient to 
5 anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
tolerance in a subject, it may also be necessary to block the function of a combination of 
B lymphocyte antigens. 

10 The efficacy of particular blocking reagents in preventing organ transplant 

rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 

1 5 described in Lenschow et al, Science 257:789-792 (1992) and Turka et al, Proc. Natl. Acad. 
Sci USA, 89:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte antigen function in vivo on the development 
of that disease. 

2 0 Blocking antigen function may also be therapeutically useful for treating 

autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
activation of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 

2 5 Administration of reagents which block costimulation of T cells by disrupting 

receptonligand interactions of B lymphocyte antigens can be used to inhibit T cell 
activation and prevent production of autoantibodies or T cell-derived cytokines which 
may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from 

3 0 the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune 

disorders can be determined using a number of well-characterized animal models of 
human autoimmune diseases. Examples include murine experimental autoimmune 
encephalitis, systemic lupus erythmatosis in MRL/lprApr mice or NZB hybrid mice, 
murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
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murine experimental myaWnia gravis (see Paul ed., Fundamentaffcunology, Raven 
Press, New York, 1989, pp. 840-856). 

Upregulation of an antigen function (preferably a B lymphocyte antigen function), 
as a means of up regulating immune responses, may also be useful in therapy. 
5 Upregulation of immune responses may be in the form of enhancing an existing immune 
response or eliciting an initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 
viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
1 0 lymphocyte antigens systemically. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
by removing T cells from the patient, costimulating the T cells in vitro with viral antigen- 

. . . ^ * nonHHp nf the nresent invention or together with a 

pulsed /\rv_s wmti cApc^-t, - r-r -■ i 

stimulatory form of a soluble peptide of the present invention and reintroducing the in 
15 vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses would be to isolate infected cells from a patient, transfect them with a nucleic 
acid encoding a protein of the present invention as described herein such that the cells 
express all or a portion of the protein on their surface, and reintroduce the transfected 
cells into the patient. The infected cells would now be capable of delivering a 
2 0 costimulatory signal to, and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of antigen function 
(preferably B lymphocyte antigen function) may be useful in the induction of tumor 
immunity. Tumor cells (e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a nucleic acid encoding at least one peptide of the present 

2 5 invention can be administered to a subject to overcome tumor-specific tolerance in the 

subject. If desired, the tumor cell can be transfected to express a combination of peptides. 
For example, tumor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 

3 0 transfected tumor cells are returned to the patient to result in expression of the peptides 

on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 
to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 

, ^ fho crf^rp of the tumor cell provides the necessary 

lympnocyie diuigcnw -- 
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costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class II 
molecules, or which fail to reexpress sufficient amounts of MHC class I or MHC class II 
molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a 



the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated 
10 immune response against the transfected tumor cell. Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class II associated protein, such 
as the invariant chain, can also be cotransfected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of tumor associated 
antigens and induce tumor specific immunity. Thus, the induction of a T cell mediated 
15 immune response in a human subject may be sufficient to overcome tumor-specific 
tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 



2 0 limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. 
Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1- 
3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. 
USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., 

2 5 J. Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., 

J. Immunol. 140:508-512, 1988; Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 
1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et aL, J. Immunol. 
135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. 
Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., 

3 0 Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching 
(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol 144:3028-3033, 1990; and Assays for B cell function: In vitro 



5 



cytoplasmic-domain truncated portion) of an MHC class I a chain protein and P 2 
microglobulin protein or an MHC class II a chain protein and an MHC class II p chain 
protein to thereby express MHC class I or MHC class II proteins on the cell surface. 
Expression of the appropriate class I or class II MHC in conjunction with a peptide having 



Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
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antibody production, Mo"j. and Brunswick, M. In C„r«r,f PrSRs in Immunology. 
J E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
protems that generate predominantly Thl and CTL responses) include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai 
et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 

, ... i^o.c/iq c^q 1QQ1- Maratrmia et al., journal of Immunology 
Experimental lvieuicmt: ^, 

154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 1995; 
Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 264:961-965, 
19 94; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj 
et al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 
et al Cytometry 13:795-808, 1992; Gorczyca et al.. Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, 
Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; 
Gorczyca et al., International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 
development include, without limitation, those described in: Antica et al., Blood 
84:111-117, 1994; Fine et al., Cellular Immunology 155:111-122, 1994; Galy et al., Blood 
85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 
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T-Tpmato poiesis Relating Activity 

A protein of the present invention may be useful in regulation of hemopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 

, . :„ „„„™rf n f mlnnv fnrmine cells or of factor-dependent cell 
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lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and 
proliferation of erythroid progenitor cells alone or in combination with other cytokines, 
thereby indicating utility, for example, in treating various anemias or for use in 
conjunction with irradiation/chemotherapy to stimulate the production of erythroid 
5 precursors and /or erythroid cells; in supporting the growth and proliferation of myeloid 
cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat consequent 
myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 

1 0 disorders such as thrombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantation, including, without 

15 limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or 
ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

2 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for proliferation and differentiation of various hematopoietic lines 
are cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 

2 5 proteins that influence embryonic differentiation hematopoiesis) include, without 

limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 
aL, Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 

3 0 others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 

described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 
Hematopoietic Cells. R.I. Freshney, et al eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 
NY. 1994; Hirayama et aL, Proc. Natl. Acad. Sci. USA 89:5907-5911, 1992; Primitive 
hematopoietic colony forming cells with high proliferative potential, McNiece, LK. and 
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Briddell, R.A. In Culture W^atopotetk Cells. R.I. Freshney, ef nWs. Vol pp. 23-39, 
Wiley-Liss, Inc., New York, NY. 1994; Neben et al., Experimental Hematology 22:353-359, 
1994- Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematoma 
Cells RI Freshney, et al. ed. Vol pp. 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long 
term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 163-179, 
Wiley-Liss Inc., New York, NY. 1994; Long term culture initiating cell assay, Sutherland, 
HJ. In Culture of Hematopcetic Cells. R.I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, 
Inc., New York, NY. 1994. 
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Tissue Growth Activity 

A protein of the present invention also may have utility in compositions used for 

ii„ am pnt and /or nerve tissue growth or regeneration, as well as 
Done, cai uicjgc, 

for wound healing and tissue repair and replacement, and in the treatment of burns, 

1 5 incisions and ulcers. 

A protein of the present invention, which induces cartilage and/or bone growth 
in circumstances where bone is not normally formed, has application in the healing of 
bone fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in closed as 

2 0 well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 

2 5 disease, and in other tooth repair processes. Such agents may provide an environment 

to attract bone-forming cells, stimulate growth of bone-forming cells or mduce 
differentiation of progenitors of bone-forming cells. A protein of the invention may also 
be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation 
of bone and/or cartilage repair or by blocking inflammation or processes of tissue 

3 0 destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory 

processes. 

Another category of tissue regeneration activity that may be attributable to the 
protein of the present invention is tendon /ligament formation. A protein of the present 

. = -..u:„u ; n A„^ tPndon/heament-like tissue or other tissue formation in 
invention, wnn.li wi^*«v-^-- — w 
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circumstances wheresuch tissue is not normally formed, has^pplication in the healing of 
tendon or ligament tears, deformities and other tendon or ligament defects in humans and 
other animals. Such a preparation employing a tendon/ligament-like tissue inducing 



5 well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon /ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 

1 0 compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 

15 tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and /or sequestering agent as a carrier as is well known in 
the art. 

The protein of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 
2 0 peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 

2 5 Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 

syndrome. Further conditions which may be treated in accordance with the present 
invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 

3 0 invention. 

Proteins of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and the like. 



protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
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It is expected that iffotein of the present invention may al^xhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) 
and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 
5 comprising such tissues. Part of the desired effects may be by inhibition or modulation 
of fibrotic scarring to allow normal tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 

A protein of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
1 0 and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
i.u~ ^^TarfV, ficcnpc rlpcirribed above. 

The activity of a protein of the invention may, among other means, be measured 

15 by the following methods: 

Assays for tissue generation activity include, without limitation, those described 

in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); 

International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 

Publication No. WO91/07491 (skin, endothelium ). 
2 0 Assays for wound healing activity include, without limitation, those described in: 

Winter, Fpi^r^l Wound Healing, pps. 71-112 (Maibach, HI and Rovee, DT, eds.), Year 

Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 

Dermatol 71:382-84 (1978). 



2 5 Activin/Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin-related 
activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 
invention, alone or in heterodimers with a member of the inhibin a family, may be useful 
as a contraceptive based on the ability of inhibins to decrease fertility in female mammals 
and decrease spermatogenesis in male mammals. Administration of sufficient amounts 
of other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
; „ a h^odimer or as a heterodimer with other protein subunits of the inhibin- 

11 L V V_l I VXWA », « ' - 
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p group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 
United States Patent 4,798,885. A protein of the invention may also be useful for 
advancement of the onset of fertility in sexually immature mammals, so as to increase the 
5 lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for activin /inhibin activity include, without limitation, those described in: 
Vale et al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et 
10 al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al, Proc. 
Natl. Acad. Sci. USA 83:3091-3095, 1986. 



Chemotactic/Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic activity 
15 (e.g., act as a chemokine) for mammalian cells, including, for example, monocytes, 

fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and /or endothelial cells. 

Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell 

population to a desired site of action. Chemotactic or chemokinetic proteins provide 

particular advantages in treatment of wounds and other trauma to tissues, as well as in 
2 0 treatment of localized infections. For example, attraction of lymphocytes, monocytes or 

neutrophils to tumors or sites of infection may result in improved immune responses 

against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 

can stimulate, directly or indirectly, the directed orientation or movement of such cell 

2 5 population. Preferably, the protein or peptide has the ability to directly stimulate directed 

movement of cells. Whether a particular protein has chemotactic activity for a population 
of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be measured 

3 0 by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration 
of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell population. Suitable assays for movement and adhesion 
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include, without limitation^se described in: Current Protocols immunology, Ed by 
J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 
Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and 
beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
5 APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153: 1762-1768, 1994. 

Hemostatic and Thrombolytic Activity 

A protein of the invention may also exhibit hemostatic or thrombolytic activity. 

10 As a result, such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
and other hemostatic events in treating wounds resulting from trauma, surgery or other 

a f +v,o inmnHnn mav also be useful for dissolving or inhibiting 

formation of thromboses and for treatment and prevention of conditions resulting 

1 5 therefrom (such as, for example, infarction of cardiac and central nervous system vessels 
(e.g., stroke). 

The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assay for hemostatic and thrombolytic activity include, without limitation, those 
2 0 described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis 
Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474, 1988. 



Rece ptor /Lieand Activity 

25 A protein of the present invention may also demonstrate activity as receptors, 

receptor ligands or inhibitors or agonists of recepfor/ligand interactions. Examples of 
such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 

3 0 cellular adhesion molecules (such as selecting integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune responses). Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 
^ iv.^a i.^^rHor, A nrntpin of the present invention (including, without 
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limitation, fragmenreof receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 
5 Suitable assays for receptor-ligand activity include without limitation those 

described in:Current Protocols in Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W.Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
10 Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 
1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al, Cell 80:661-670, 
1995. 



Anti-Inflammatory Activity 

15 Proteins of the present invention may also exhibit anti-inflammatory activity. The 

anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 
the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 

2 0 suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities can be used to treat 
inflammatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 

2 5 lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 
from over production of cytokines such as TNF or IL-1. Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 



3 0 Cadherin/ Tumor Invasion Suppressor Activity 

Cadherins are calcium-dependent adhesion molecules that appear to play major 
roles during development, particularly in defining specific cell types. Loss or alteration 
of normal cadherin expression can lead to changes in cell adhesion properties linked to 
tumor growth and metastasis. Cadherin malfunction is also implicated in other human 
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diseases, such as pemphiguWlgaris and pemphigus foliaceus (auWmune blistering 
skin diseases), Crohn's disease, and some developmental abnormalities. 

The cadherin superfamily includes well over forty members, each with a distinct 
pattern of expression. All members of the superfamily have in common conserved 
5 extracellular repeats (cadherin domains), but structural differences are found in other 
parts of the molecule. The cadherin domains bind calcium to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino acids in the 
first cadherin domain provide the basis for homophilic adhesion; modification of this 
recognition site can change the specificity of a cadherin so that instead of recognizing only 
1 0 itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophils adhesion with other cadherins. 

E-cadherin, one member of the cadherin superfamily, is expressed in epithelial cell 
types. Pathologically, if E-cadherin expression is lost in a tumor, the malignant cells 
become invasive and the cancer metastasizes. Transfection of cancer cell lines with 
15 polynucleotides expressing E-cadherin has reversed cancer-associated changes by 
returning altered cell shapes to normal, restoring cells' adhesiveness to each other and to 
their substrate, decreasing the cell growth rate, and drastically reducing anchorage- 
independent cell growth. Thus, reintroducing E-cadherin expression reverts carcinomas 
to a less advanced stage. It is likely that other cadherins have the same invasion 
2 0 suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 
encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 
in these cells by providing normal cadherin expression. 

2 5 Cancer cells have also been shown to express cadherins of a different tissue type 

than their origin, thus allowing these cells to invadeand metastasize in a different tissue 
in the body. Proteins of the present invention with cadherin activity, and polynucleotides 
of the present invention encoding such proteins, can be substituted in these cells for the 
inappropriately expressed cadherins, restoring normal cell adhesive properties and 

3 0 reducing or eliminating the tendency of the cells to metastasize. 

Additionally, proteins of the present invention with cadherin activity, and 
polynucleotides of the present invention encoding such proteins, can used to generate 
antibodies recognizing and binding to cadherins. Such antibodies can be used to block 
the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 
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forming a tumor elsewhere. Such an anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 
cancer, the less cadherin expression there will be, and this decrease in cadherin expression 
can be detected by the use of a cadherin-binding antibody. 
5 Fragments of proteins of the present invention with cadherin activity, preferably 

a polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 
to block cadherin function by binding to cadherins and preventing them from binding in 
ways that produce undesirable effects. Additionally, fragments of proteins of the present 

1 0 invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
encoding such protein fragments, can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 

15 Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 

Tumor Inhibition Activity 

In addition to the activities described above for immunological treatment or 
prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
2 0 A protein may inhibit tumor growth directly or indirectly (such as, for example, via 
ADCC). A protein may exhibit its tumor inhibitory activity by acting on tumor tissue or 
tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as, for example, by inhibiting angiogenesis), by causing production of other 
factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating 

2 5 or inhibiting factors, agents or cell types which promote tumor growth. 

Other Activities 

A protein of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious 

3 0 agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 

(suppressing or enhancing) bodily characteristics, including, without limitation, height, 
weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 
or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; 
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effecting the fertility of JrTor female subjects; effecting the meWlism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 
carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 
effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
5 cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
1 0 disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen 
in a vaccine composition to raise an immune response against such protein or another 
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15 ATVMINISTP ATT ON AND POSING 

A protein of the present invention (from whatever source derived, including 
without limitation from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutical^ acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fillers, 
salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, 
IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem 
cell factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or compliment its activity or use 
in treatment. Such additional factors and/or agents may be included in the 
3 0 pharmaceutical composition to produce a synergistic effect with protein of the invention, 
or to minimize side effects. Conversely, protein of the present invention may be included 
in formulations of the particular cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects 
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of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent. 

A protein of the present invention may be active in multimers (e.g., heterodimers 
or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
5 compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 

The pharmaceutical composition of the invention may be in the form of a complex 
of the protein(s) of present invention along with protein or peptide antigens. The protein 
and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B 

1 0 lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins 
including those encoded by class I and class II MHC genes on host cells will serve to 
present the peptide antigen(s) to T lymphocytes. The antigen components could also be 

1 5 supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that 
can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the 
invention. 

2 0 The pharmaceutical composition of the invention may be in the form of a liposome 

in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 

2 5 monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, 

and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in U.S. Patent No. 4,235,871; U.S. Patent No. 4,501,728; U.S. 
Patent No. 4,837,028; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference. 

3 0 As used herein, the term "therapeutically effective amount" means the total 

amount of each active component of the pharmaceutical composition or method that is 
sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
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ingredient, administered aSe, the term refers to that ingredient alo^ When applied to 
a combination, the term refers to combined amounts of the active ingredients that result 
in the therapeutic effect, whether administered in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-administered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If 
administered sequentially, the attending physician will decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic 
factors. 

Administration of protein of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 
Intravenous administration to the patient is preferred. 

When a therapeutically effective amount of protein of the present invention is 
administered orally, protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 
an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 
When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal 
or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
may be added. The liquid form of the pharmaceutical composition may further contain 
physiological saline solution, dextrose or other saccharide solution, or glycols such as 
ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 
form, the pharmaceutical composition contains from about 0.5 to 90% by weight of protein 
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of the present invention, and preferably from about 1 to 30% protein of the present 
invention. 

When a therapeutically effective amount of protein of the present invention is 
administered by intravenous, cutaneous or subcutaneous injection, protein of the present 
5 invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. 
The preparation of such parenterally acceptable protein solutions, having due regard to 
pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should 
contain, in addition to protein of the present invention, an isotonic vehicle such as Sodium 

1 0 Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringers Injection, or other vehicle as known in the art. The 
pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. 
The amount of protein of the present invention in the pharmaceutical composition 

15 of the present invention will depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the patient has undergone. 
Ultimately, the attending physician will decide the amount of protein of the present 
invention with which to treat each individual patient. Initially, the attending physician 
will administer low doses of protein of the present invention and observe the patient's 

2 0 response. Larger doses of protein of the present invention may be administered until the 

optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 jag to about 100 
mg (preferably about O.lng to about 10 mg, more preferably about 0.1 ug to about 1 mg) 
25 of protein of the present invention per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 
present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 
contemplated that the duration of each application of the protein of the present invention 

3 0 will be in the range of 12 to 24 hours of continuous intravenous administration. 

Ultimately the attending physician will decide on the appropriate duration of intravenous 
therapy using the pharmaceutical composition of the present invention. 

Protein of the invention may also be used to immunize animals to obtain 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 
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antibodies may be obtaJRsing either the entire protein or fralUnts thereof as an 
immunogen. The pephde immunogens additionally may contain a cysteine residue at the 
carb oxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyarun 
(KLH) Methods for synthesizing such peptides are known in the art, for example, as in 
R P. Merrifield, J. Amer.Chem.Soc 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEBS Lett. 
211 10 (1987). Monoclonal antibodies binding to the protein of the invention may be 
^ful diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells neutralizing monoclonal antibodies against the protein may be useful in detecting 
and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locally as an implant or device. When 
administered, the therapeutic composition for use in this invention is, of course, m a 
pyrogen-free, physiologically acceptable form. Further, the composition may desirably 
be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or 
) tissue damage. Topical administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and/or cartilage formation, the 
5 composition would include a matrix capable of delivering the protein-containmg 
composition to the site of bone and/or cartilage damage, providing a structure for the 
developing bone and cartilage and optimally capable of being resorbed into the body. 
Such matrices may be formed of materials presently in use for other implanted medical 
applications. 

, o The choice of matrix material is based on biocompatibiHty, biodegradability, 

mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 

„ . u^i,^o hvHmxvaoatite. polvlactic acid, polyglycolic acid and 

suitate, uiLdiLiuiiipn^Y ilw ' v ' *v — J 1 
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polyanhydrides. Other potential materials are biodegradable and biologically well- 



defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins 
or extracellular matrix components. Other potential matrices are nonbiodegradable and 
chemically defined, such as sintered hydroxapatite, bioglass, aluminates, or other 
5 ceramics. Matrices may be comprised of combinations of any of the above mentioned 
types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in composition, such as in calcium- 
aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 

10 Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and glycolic 

acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions 
from disassociating from the matrix. 

15 A preferred family of sequestering agents is cellulosic materials such as 

alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of 
carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic 

2 0 acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl 
polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 0.5-20 
wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 
provide appropriate handling of the composition, yet not so much that the progenitor cells 

2 5 are prevented from infiltrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progenitor cells. 



3 0 (EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-a and 
TGF-P), and insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary 
applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins of the present invention. 



In further compositions, proteins of the invention may be combined with other 
agents beneficial to the treatment of the bone and /or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor 
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The dosage regimR a protein-containing pharmaceuticWomposiuon to be 
used in tissue regeneration will be determined by the attending physician considering 
various factors which modify the action of the proteins, e.g., amount of tissue weight 
desired to be formed, the site of damage, the condition of the damaged tissue, the S1 ze of 
5 a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity 
of any infection, time of administration and other clinical factors. The dosage may vary 
with the type of matrix used in the reconstitute and with inclusion of other proteins m 
the pharmaceutical composition. For example, the addition of other known growth 
factors such as IGF I (insulin like growth factor I), to the final composition, may also effect 
10 the dosage. Progress can be monitored by periodic assessment of tissue/bone growth 
and/or repair, for example, X-rays, histomorphometric determinations and tetracycline 
labeling. 

also be used for gene therapy, bucn 

X^OiynuCieouut:i> ui uic — 

polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
1 5 mammalian subject. Polynucleotides of the invention may also be administered by other 

known methods for introduction of nucleic acid into a cell or organism (including, without 

limitation, in the form of viral vectors or naked DNA). 

Cells may also be cultured ex vivo in the presence of proteins of the present 

invention in order to proliferate or to produce a desired effect on or activity in such cells. 
2 0 Treated cells can then be introduced in vivo for therapeutic purposes. 

Patent and literature references cited herein are incorporated by reference as if 
fully set forth. 
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SEQUENCE LISTING 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Jacobs, Kenneth 
McCoy, John M . 
LaVallie, Edward R. 
Racie, Lisa A. 
Merberg, David 
Treacy, Maurice 
Spaulding, Vikki 
Agostino, Michael J. 

<ii) TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 
ENCODING THEM 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: 87 Cambridge Park Drive 

(C) CITY : Cambridge 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02140 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

( C ) CLASSIFICATION : 

(viii) ATTORNEY / AG ENT INFORMATION: 

(A) NAME: Sprunger, Suzanne A. 

(B) REGISTRATION NUMBER: 41,323 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-82 84 

(B) TELEFAX: (617) 876-5851 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1433 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPETT cDNA 



it 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GCCCGTGGTT ACACAGCTAA TAGGTGGTGG AGATGGAGAC AG AATT C AAA CCCAGGCATT 
CTTGATCTAC AGTATACACT CTTACCCACC ATCCTACACA GCCTTTCTTA TTCATAAAAT 
ATTTTCTACA GTGCAAGAAA ATTTTGATAG CTTGCTTATT TATTCAAGAT TTAGACTATA 
TAG ATTAAC T AGACTATCAA GATTTTAAAT TCTTGTGTTT TTTGTTTTTY YCCCCCTCTG 
TGGCATAACT ATCTCTTAGT GATTTGAAGT TCTGATAGGC ATTTATTTAT GTTTTTGATT 
AATTAAAAAA AGGGAAAAAA ATGGAACATA AT TATTGAAG CTATCGTCTA GGTAAAAACC 
TTTCTAAATG TAAGGTTCAT TTAGATTGAT GACC TGTAG A GTGTAACAGT ATTGC CAT AG 
GC ATACAGC T TTTTAATCAC ATATCATACA TAAACAAATT AGTAATACAG GTGGGTAGAT 
ACAGACCCTA ACTTTGAGCT CTAAGATGAA ATTTGTTTAT AAATCCCTAG TTTCCATTCA 
GTTTTTTCAA TATTTATCAA ACACCTACTG TGCCAGGCAT TGTTTAGGCA CAGGGGATAC 
AGCAGGAGAA CAAAATGAAC AAAATTTTTT GCCTTCACAG AGCTAATTTT TTGTATTTTT 
TTGTAGAGAT GGGGTTTTGC CATGTTTGCC AGTCTGGTCT CAACCTCCTA AGCTCAAGCA 
GCCCACCCTC CTTGGCTTCC CAAAGTGCTG AGATTACAGG CATGAGCCAC CGCACTCTTC 
TTAGCTATTT TTCATAGAAA CTTTATGTAT AAAAATAGAA GGGTAATGAC ACACCACCTT 
TCTACTGATC TCCCCACTTC AGTAGTTATC ACATAACAGT CTTTTTTCAC CTATCTCCTT 
CACTTTAC CT CCTCTCCCTT AGTACTTTGA AGTAAATCTC AATGCAAGCT GGTATGTTTT 
TCAAAATGAA ACATATAAAC ATGGACTAGA AAAAAATCTC TTCATACAGG ATTTGGTTTT 
GCAGAGAATT TACAAAGTGC GGTTAATGTA TGCCAATGGT TTCTCAGTTT GGATATCGAG 
ATCCTTAGAT GGACCATGAA GCTGGTAATA ATTTTATAGC TAACTTTTGT TAAGTGCTTA 
CTATATGCCA GGCACTGTTC TAAGCATTTT ACGTGTATTC ATTCATTCAG TTCTCACAAC 
TCTTTTAATT AGGTATTATT ATGATCTCCA TCTCAAAACA AAACAAAACA AAAAAATTAG 
CCTGGCATGG TGGCAGGCGC CTGTAATCCC AGTTACTTGA GAGGCTAAGG CAGGAGAATC 
GCTTGAATCT GGGAGGCAGA GGTTGCAGTG AGCCGAGATT GCACTACTGC ACTCCAGCCT 
GGGTGACAGA ATGAGACTCT GTCTCAAAAA AAAAAAAAAA AAAAAAAAAA AAA 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Phe Val Tyr Lys Ser Leu Val Ser lie Gin Phe Phe Gin Tyr 
15 10 15 

Leu Ser Asn Thr Tyr Cys Ala Arg His Cys Leu Gly Thr Gly Asp Thr 
20 25 30 

Ala Gly Glu Gin Asn Glu Gin Asn Phe Leu Pro Ser Gin Ser 
35 40 45 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TCGGGACAGA TTTAAGTGCA GCGTGGATTT TTTTTTTCTC ACTTTGCCTT GTGTTTTCCA 60 

CCCTGAAAGA ATGTTGTGGC TGCTCTTTTT TC TGGTG ACT GCCATTCATG CTGAACTCTG 12 0 

TCAACCAGGT GCAGAAAATG CTTTTAAAGT GAG AC TT AG T ATCAGAACAG CTCTGGGAGA 180 

TAAAGCATAT GCCTGGGATA CCAATGAAGA ATACCTCTTC AAAGCGATGG TAGCTTTCTC 240 

CATGAGAAAA GTTCCCAACA GAGAAGCAAC AGAAATTTCC CATGTCCTAC TTTGCAATGT 3 00 

AACCCAGAGG GTATCATTCT GGTTTGTGGT T AC AG AC C C T TCAAAAAATC ACACCCTTCC 3 60 

TGCTGTTGAG GTGCAATCAG CCATAAGAAT GAACAAGAAC CGGATCAACA ATGCCTTCTT 420 

TGTAAATGAC CAAACTCTGG AATTTTTAAA AATCCCTTCC ACACTTGCAC CACCCATGGA 480 
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CCCATCTGTG CCCATCTGGA TO^TATATT TGGTGTGATA TTTTGCATCA tWaGTTGC 540 

AATTGCACTA CTGATTTTAT CAGGGATCTG GCAACGTAGA AGAAAGAACA AAGAACCATC 60 0 

TGAAGTGGAT GACGCTGAAG ATAAGTGTGA AAACATGATC ACAATTGAAA ATGGCATCCC 66 0 

CTCTGATCCC CTGGACATGA AGGGAGGGCA TATTAATGAT GCCTTCATGA CAGAGGATGA 72 0 

GAGGCTCACC CCTCTCTGAA GGGCTGTTGT TCTGCTTCCT CAAGAAATTA AACATTTGTT 7 80 

TCTGTGTGAG TGCTGAGCAT CCTGAAATAC CAAGAGCAGA TCATATATTT TGTTTCACCA 8 40 

TTCTTCTTTT GTAATAAATT TTGAATGTGC TTGAAAGTGA AAAGCAATCA ATT AT AC CCA 90 0 

CCAACACCAC TGAAATCATA AGCTATTCAC GACTCAAAAT ATTCTAAAAT ATTTTTCTGA 9 60 

CAGTATAGTG TATAAATGTG GTCATGTGGT ATTTGTAGTT ATTGATTTAA GCATTTTTAG 102 0 

AAATAAGATC AGGCATATGT ATATATTTTC ACACTTCAAA GACCTAAGGA AAAATAAATT 108 0 

TTCCAC-TC-GA GAATACATAT AATATGGTGT AGAAATCATT GAAAATGGAT CCTTTTTGAC 1140 

GATCACTTAT ATCACTCTGT ATATGACTAA GTAAACAAAA GTGAGAAGTA ATTATTGTAA 12 00 

ATGGATGGAT AAAAATGGAA TTACTCATAT ACAGGGTGGA ATTTTATCCT GTTATCACAC 12 6 0 

CAACAGTTGA TTATATATTT TCTGAATATC AGCCCCTAAT AGGACAATTC TATTTGTTGA 13 2 0 

CCATTTCTAC AATTTGTAAA AGTCCAATCT GTGCTAACTT AATAAAGTAA TAATCATCTC 13 80 

TTTTAAAAAA AAAAAAAAAA A 14 01 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Leu Trp Leu Leu Phe Phe Leu Val Thr Ala lie His Ala Glu Leu 
1 5 10 15 

Cys Gin Pro Gly Ala Glu Asn Ala Phe Lys Val Arg Leu Ser lie Arg 
20 25 30 

Thr Ala Leu Gly Asp Lys Ala Tyr Ala Trp Asp Thr Asn Glu Glu Tyr 
35 40 45 
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Leu Phe Lys Ala Met Val Ala Phe Ser Met Arg Lys Val Pro Asn Arg 

50 55 60 

Glu Ala Thr Glu lie Ser His Val Leu Leu Cys Asn Val Thr Gin Arg 
65 70 75 80 

Val Ser Phe Trp Phe Val Val Thr Asp Pro Ser Lys Asn His Thr Leu 
85 90 95 

Pro Ala Val Glu Val Gin Ser Ala lie Arg Met Asn Lys Asn Arg lie 
100 105 110 

Asn Asn Ala Phe Phe Val Asn Asp Gin Thr Leu Glu Phe Leu Lys lie 
115 120 125 

Pro Ser Thr Leu Ala Pro Pro Met Asp Pro Ser Val Pro lie Trp lie 
130 135 140 

lie lie Phe Gly Val lie Phe Cys lie lie lie Val Ala lie Ala Leu 
145 150 155 160 

Leu lie Leu Ser Gly lie Trp Gin Arg Arg Arg Lys Asn Lys Glu Pro 
165 170 175 

Ser Glu Val Asp Asp Ala Glu Asp Lys Cys Glu Asn Met lie Thr lie 
180 185 190 

Glu Asn Gly He Pro Ser Asp Pro Leu Asp Met Lys Gly Gly His lie 
195 200 205 

Asn Asp Ala Phe Met Thr Glu Asp Glu Arg Leu Thr Pro Leu 
210 215 220 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GCGGCCGCAG GTC TAG AATT CAATCGGCCA CAAGCTACTC TTTGGAGCCC ATCTATGGTT 60 

TGTGGTATGA CCACTCCTCC AACTTCTCCT GGAAATGTCC CACCTGATCT GTCACACCCT 12 0 

TACAGTAAAG TCTTTGGTAC AACTGCAGGT GGAAAAGGAA CTCCTCTGGG AACCCCAGCA 18 0 

ACCTCTCCTC CTCCAGCCCC ACTCTGTCAT TCGGATGACT ACGTGCACAT TTCACTCCCC 2 40 
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CAGGCCACAG TCACACCCCC^^GGAAGGAA GAGAGAATGG ATTCTGCAAG aPcATGTCTA 3 00 

CACAGACAAC ACCATCTTCT GAATGACAGA GGATCAGAAG AGCCACCTGG CAGCAAAGGT 3 60 

TCTGTCACTC TAAGTGATCT TCCAGGGTTT TTAGGTGATC TGGCCTCTGA AGAAGATAGT 420 

ATTGAAAAAA AAAAAAAAAA A 441 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Val Cys Gly Met Thr Thr Pro Pro Thr Ser Pro Gly Asn Val Pro 
1 5 10 15 

Pro Asp Leu Ser His Pro Tyr Ser Lys Val Phe Gly Thr Thr Ala Gly 
20 25 30 

Gly Lys Gly Thr Pro Leu Gly Thr Pro Ala Thr Ser Pro Pro Pro Ala 
35 40 45 

Pro Leu Cys His Ser Asp Asp Tyr Val His lie Ser Leu Pro Gin Ala 
50 55 60 

Thr Val Thr Pro Pro Arg Lys Glu Glu Arg Met Asp Ser Ala Arg Pro 
65 70 75 80 

Cys Leu His Arg Gin His His Leu Leu Asn Asp Arg Gly Ser Glu Glu 
85 90 95 

Pro Pro Gly Ser Lys Gly Ser Val Thr Leu Ser Asp Leu Pro Gly Phe 
100 105 HO 

Leu Gly Asp Leu Ala Ser Glu Glu Asp Ser lie 
115 120 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2353 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



67 



WO 98/32853 

(ii) MOLECULE TYPE : 



cDNA 



PCT7US98/01396 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AAGAAGGCGA TGTCACTATT GGAGAAGATG CACCAAATCT TTCTTTTAGC ACCAGTGTGG 6 0 

GAAATGAGGA CGCCAGGACA GCCTGGCCCG AATTACAACA GAGCCATGCT GTTAATCAGC 12 0 

TCAAAGATTT GTTGCGCCAA CAAGCAGATA AGGAAAGTGA AGTATCTCCG TCAAGAAGAA 180 

GAAAAATGTC CCCCTTGAGG TCATTAGAAC ATGAGGAAAC CAATATGCCT ACTATGCACG 2 40 

ACCTTGTTCA TACTATTAAT GACCAGTCTC AATATATTCA TCATTTAGAG GCAGAAGTTA 3 00 

AGTTCTGCAA GGAGGAACTC TCTGGAATGA AAAATAAAAT ACAAGTAGTT GTGCTTGAAA 3 60 

ACGAAGGGCT CCAGCAACAG CTAAAATCTC AAAGACAAGA GGAGAC AC TG AGGGAACAAA 42 0 

CACTTCTGGA TGCATCCGGA AACATGCACA ATTCTTGGAT TACAACAGGT GAAGATTCTG 4 80 

GGGTGGGCGA AACCTCCAAA AGACCATTTT CCCATGACAA TGCAGATTTT GGCAAAGCTG 540 

CATCTGCTGG TGAGCAGCTA GAACTGGAGA AGCTAAAACT TACTTATGAG GAAAAGTGTG 60 0 

AAATTGAGGA ATCCCAATTG AAGTTTTTGA GGAACGACTT AGCTGAATAT CAGAGAACTT 66 0 

GTGAAGATCT TAAAGAGCAA C T AAAGC AT A AAGAATTTCT TCTGGCTGCT AATACTTGTA 72 0 

ACCGTGTTGG TGGTCTTTGT TTGAAATGTG CTCAGCATGA AGCTGTTCTT TCCCAAACCC 7 80 

ATACTAATGT TCATATGCAG ACCATCGAAA GACTGGTTAA AGAAAGAGAT GACTTGATGT 84 0 

CTGCACTAGT TTCCGTAAGG AGCAGCTTGG CAGATACGCA GCAAAGAGAA GCAAGTGCTT 90 0 

ATGAACAGGT GAAACAAGTT TTGCAAATAT CTGAGGAAGC CAATTTTGAA AAAACCAAGG 96 0 

CTTTAATCCA GTGTGACCAG TTGAGGAAGG AGCTGGAGAG GCAGGCGGAG CGACTTGAAA 102 0 

AAGAACTTGC ATCTCAGCAA GAGAAAAGGG CCATTGAGAA AGACATGATG AAAAAGGAAA 108 0 

TAACGAAAGA AAGGG AG T AC ATGGGATCAA AGATGTTGAT CTTGTCTCAG AATATTGCCC 114 0 

AACTGGAGGC CCAGGTGGAA AAGGTTACAA AGGAAAAGAT TTCAGCTATT AATCAACTGG 12 00 

AGGAAATTCA AAGCCAGCTG GCTTCTCGGG AAATGGATGT CACAAAGGTG TGTGGAGAAA 12 6 0 

TGCGCTATCA GCTGAATAAA ACCAACATGG AGAAGGATGA GGCAGAAAAG GAGCACAGAG 13 2 0 

AGTTCAGAGC AAAAACTAAC AGGGATCTTG AAATTAAAGA TCAGGAAATA GAGAAATTGA 13 80 

GAATAGAACT GGATGAAAGC AAACAACACT TGGAACAGGA GCAGCAGAAG GCAGCCCTGG 1440 
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CCAGAGAGGA GTGCCTGAGA^KaCAGAAC TGCTGGGCGA ATCTGAGCAC ^tTGCACC 15 00 

TC AC C AG AC A GGAAAAAGAT AGCATTCAGC AGAGCTTTAG CAAGGAAGCA AAGGCCCAAG 15 6 0 

CCCTTCAGGC CCAGCAAAGA GAGCAGGAGC TGACACAGAA GATACAGCAA ATGGAAGCCC 162 0 

AGCATGACAA AACTGAAAAT GAACAGTATT TGTTGCTGAC CTCCCAGAAT ACATTTTTGA 1680 

CAAAGTTAAA GGAAGAATGC TGTACATTAG CCAAGAAACT GGAACAAATC TCTCAAAAAA 17 40 

CCAGATCTGA AATAGCTCAA CTCAGTCAAG AAAAAAGGTA TACATATGAT AAATTGGGAA 180 0 

AGTTACAGAG AAGAAATGAA GAATTGGAGG AACAGTGTGT CCAGCATGGG AGAGTACATG 18 6 0 

AGACGATGAA GCAAAGGCTA AGGCAGCTGG ATAAGCACAG CCAGGCCACA GCCCAGCAGC 192 0 

TGGTGCAGCT CCTCAGCAAG CAGAACCAGC TTCTCCTGGA GAGGCAGAGC CTGTCGGAAG 19 8 0 

AGGTGGACCG GCTGCGGACC CAGTTACCCA GCATGCCACA ATCTGATTGC TGACCTGGAT 2040 

GGAACAGART GAAATAAATG ATTTACAAAG AGATATTTAC ATTCATCTGG TTTAGACTTA 210 0 

ATATGCCACA ACGCACCACG ACCTTCCCAG GGTGACACCG CCTCAGCCTG CAGTGGGGCT 2160 

GGTCCTCATC AACGCGGGCG CTGTCCCCGC ACGCAGTCGG GCTGGAGCTG GAGTCTGACT 22 2 0 

CTAGCTGAGC AGAGCTCCTG GTGTATGTTT TCAGAAATGG CTTGAAGTTA TGTGTTTAAA 2280 

TCTGCTCATT CGTATGCTAG GTTATACATA TGATTTTCAA TAAATGAACT TTTTAAAGAA 2 3 40 

2353 

AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Ser Pro Leu Arg Ser Leu Glu His Glu Glu Thr Asn Met Pro Thr 
1 5 10 15 

Met His Asp Leu Val His Thr He Asn Asp Gin Ser Gin Tyr He His 
20 25 30 

His Leu Glu Ala Glu Val Lys Phe Cys Lys Glu Glu Leu Ser Gly Met 
35 40 45 
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Lys Asn Lys He Gin Val Val Val Leu Glu Asn Glu Gly Leu Gin Gin 

50 55 60 

Gin Leu Lys Ser Gin Arg Gin Glu Glu Thr Leu Arg Glu Gin Thr Leu 
65 70 75 80 

Leu Asp Ala Ser Gly Asn Met His Asn Ser Trp He Thr Thr Gly Glu 
85 90 95 

Asp Ser Gly Val Gly Glu Thr Ser Lys Arg Pro Phe Ser His Asp Asn 
100 105 110 

Ala Asp Phe Gly Lys Ala Ala Ser Ala Gly Glu Gin Leu Glu Leu Glu 
115 120 125 

Lys Leu Lys Leu Thr Tyr Glu Glu Lys Cys Glu He Glu Glu Ser Gin 
130 135 140 

Leu Lys Phe Leu Arg Asn Asp Leu Ala Glu Tyr Gin Arg Thr Cys Glu 
145 150 155 160 

Asp Leu Lys Glu Gin Leu Lys His Lys Glu Phe Leu Leu Ala Ala Asn 
165 170 175 

Thr Cys Asn Arg Val Gly Gly Leu Cys Leu Lys Cys Ala Gin His Glu 
180 185 190 

Ala Val Leu Ser Gin Thr His Thr Asn Val His Met Gin Thr He Glu 
195 200 205 

Arg Leu Val Lys Glu Arg Asp Asp Leu Met Ser Ala Leu Val Ser Val 
210 215 220 

Arg Ser Ser Leu Ala Asp Thr Gin Gin Arg Glu Ala Ser Ala Tyr Glu 
225 230 235 240 

Gin Val Lys Gin Val Leu Gin He Ser Glu Glu Ala Asn Phe Glu Lys 
245 250 255 

Thr Lys Ala Leu He Gin Cys Asp Gin Leu Arg Lys Glu Leu Glu Arg 
260 265 270 

Gin Ala Glu Arg Leu Glu Lys Glu Leu Ala Ser Gin Gin Glu Lys Arg 
275 280 285 

Ala He Glu Lys Asp Met Met Lys Lys Glu He Thr Lys Glu Arg Glu 
290 295 300 

Tyr Met Gly Ser Lys Met Leu He Leu Ser Gin Asn He Ala Gin Leu 
305 310 315 320 

Glu Ala Gin Val Glu Lys Val Thr Lys Glu Lys He Ser Ala He Asn 
325 330 335 

Gin Leu Glu Glu He Gin Ser Gin Leu Ala Ser Arg Glu Met Asp Val 
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340 



345 350 



Thr Lys Val Cys Gly Glu Met Arg Tyr Gin Leu Asn Lys Thr Asn Met 
355 360 365 

Glu Lys Asp Glu Ala Glu Lys Glu His Arg Glu Phe Arg Ala Lys Thr 
370 375 380 

Asn Arg Asp Leu Glu lie Lys Asp Gin Glu He Glu Lys Leu Arg lie 



385 



390 395 400 



Glu Leu Asp Glu Ser Lys Gin His Leu Glu Gin Glu Gin Gin Lys Ala 
405 410 415 

Ala Leu Ala Arg Glu Glu Cys Leu Arg Leu Thr Glu Leu Leu Gly Glu 
420 425 430 

Ser Glu His Gin Leu His Leu Thr Arg Gin Glu Lys Asp Ser He Gin 
435 440 445 

sr Lys Glu Ala Lys Ala Gin Ala Leu Gin Ala Gin Gin 
450 455 460 

Ara Glu Gin Glu Leu Thr Gin Lys He Gin Gin Met Glu Ala Gin His 
465 470 475 480 

Asp Lys Thr Glu Asn Glu Gin Tyr Leu Leu Leu Thr Ser Gin Asn Thr 
485 490 495 

Phe Leu Thr Lys Leu Lys Glu Glu Cys Cys Thr Leu Ala Lys Lys Leu 
500 505 510 

Glu Gin He Ser Gin Lys Thr Arg Ser Glu He Ala Gin Leu Ser Gin 
515 520 525 

Glu Lys Arg Tyr Thr Tyr Asp Lys Leu Gly Lys Leu Gin Arg Arg Asn 
530 535 540 

Glu Glu Leu Glu Glu Gin Cys Val Gin His Gly Arg Val His Glu Thr 

ccn S 5 5 560 

545 550 33:5 

Met Lys Gin Arg Leu Arg Gin Leu Asp- Lys His Ser Gin Ala Thr Ala 
565 570 575 

Gin Gin Leu Val Gin Leu Leu Ser Lys Gin Asn Gin Leu Leu Leu Glu 
580 585 590 

Arg Gin Ser Leu Ser Glu Glu Val Asp Arg Leu Arg Thr Gin Leu Pro 
595 600 605 

Ser Met Pro Gin Ser Asp Cys 
610 615 

(2) INFORMATION FOR SEQ ID NO : 9 : 
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(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GCGACCTCTT CTGCGGCCGG CCTGGGCAGG TGTCTTCCTC GAGAGGCAGG CAGGGGATCC 60 

CGGAC AC TAG CTTTATCGTC ATCTGGGAAA TTGTTAAAAA TGCAAATTCG CAAGTTTGAG 12 0 

AGCCATGGTT CCAAGAAACT GCATAAGCAT ACGAAATAAG TTGCAGCCTC CCGACTTATA 180 

CCCTGGTACT TCTAGTCTAA AACAGGATTT GACTCTACTA ATCCAGCCTT ATACAGGATG 240 

CTGTGTTCTT TGCTCCTTTG TGAATGTCTG TTGCTGGTAG CTGGTTATGC TCATGATGAT 3 00 

GACTGGATTG ACC 313 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 677 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCTTGGATGA TGCATTAAGT GATATTTTAA TTAATTTTAA GTTTCATGAT TTTGAAACAT 6 0 

GGAAGTGGCG ATTCGAAGAT TC CTTTGG AG TGGATC C ATA TAATGTGTTA ATGGTAATTC 12 0 

TTTGTCTGCT CTGCATCGTG GTTTTAGTGG CTACTGAGCT GTGGACATAT GTATGTTGGT 180 

ACACTCAGTT GAGACGTGTT TTAATCATCA GCTTTCTGTT C AG TTTGGG A TGGAATTGGA 24 0 

TGTATTTATA TAAGCTAGCT TTTGCACAGC ATCAGGCTGA AGTCGCCAAG ATGGAGCCAT 3 00 

TAAACAATGT GTGTGCCAAA AAGATGGACT GGACTGGAAG TATCTGGGAA TGGTTTAGAA 3 60 

GTTCATGGAC CTATAAGGAT GACCCATGCC AAAAAT AC T A TGAGCTCTTA CTAGTCAACC 42 0 

CTATTTGGTT GGTCCCACCA ACAAAGGCAC TTGCAGTTAC ATTCACCACA TTTGTAACGG 4 80 
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660 
677 



AGCCATTGAA GCATATTGGA ISaGGAACTG GGGAATTTAT TAAAGCACTC ATCAAGGAAA 54 0 

TTCCAGCGCT GCTTCATCTT CCAGTGCTGA TAATTATGGC ATTAGCCATC CTGAGTTTCT 6 00 

GCTATGGTGC TGGAAAATCA GTTCATGTGC TGAGACATAT AGGCGGTCCT GAGAGCGAAC 
CTCCCCAGGC ACTTCGG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Met Val He Leu Cys Leu Leu Cys He Val Val Leu Val Ala Thr Glu 
1 5 10 15 

Leu Trp Thr Tyr Val Cys Trp Tyr Thr Gin Leu Arg Arg Val Leu He 
20 25 30 

He Ser Phe Leu Phe Ser Leu Gly Trp Asn Trp Met Tyr Leu Tyr Lys 
35 40 45 

Leu Ala Phe Ala Gin His Gin Ala Glu Val Ala Lys Met Glu Pro Leu 
50 55 60 

Asn Asn Val Cys Ala Lys Lys Met Asp Trp Thr Gly Ser He Trp Glu 
65 70 75 80 

Trp Phe Arg Ser Ser Trp Thr Tyr Lys Asp Asp Pro Cys Gin Lys Tyr 
85 90 95 

Tyr Glu Leu Leu Leu Val Asn Pro He Trp Leu Val Pro Pro Thr Lys 
100 105 HO 

Ala Leu Ala Val Thr Phe Thr Thr Phe Val Thr Glu Pro Leu Lys His 
115 120 125 

He Gly Lys Gly Thr Gly Glu Phe He Lys Ala Leu Met Lys Glu He 
130 135 140 

Pro Ala Leu Leu His Leu Pro Val Leu He He Met Ala Leu Ala He 
145 150 155 160 

Leu Ser Phe Cys Tyr Gly Ala Gly Lys Ser Val His Val Leu Arg His 
165 170 175 
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lie Gly Gly Pro Glu Ser Glu Pro Pro Gin Al 
180 185 



eu Arg 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AGACGGCAGG AGGAATTGAT TATAGACCTG ATGGTGGAGC AGGTGATGCC GATTTCCATT 60 

ATAGGGGCCA AATGGGCCCC ATTGAGCAAG GCCCTTATGC CAAAATGTAT GAGGGTAGAA 120 

GAGAGATTTT GAGAGAGAGA GATGTTGACT TGAGATTTCA GGCTGGTCTC GAACTCCTGA 180 

CCTCAAGTGA CCCGCCCTTG TCGGCCTCCC AAAGTGCTGG GATTACAGGC ATGAGCCATT 24 0 

GTGCCCAGCC TATATAGTGT GAAGCTTTTA GGAAAATCAG AACAGGGTAG ACAGTTGTTA 3 00 

AAAACAATGT TTAAATGGAA TAATGTTGAA TGTTTACAGG CTGTAAGAAT TATTGTATAC 3 60 

ACAAAATAAT ACACAAAGTT TGTACTTTGT GTACAAATAC AAATTTGTAC TTTGTGTACA 42 0 

AATAATACAA AAAGTTTGTA TACACAAAAA AAAAAAAAAA AAAAAAAAAA 47 0 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTGGAGTCCA CGCGGATTTT CGAAGCTGGG GCTGGCAAGA GGCCGCTGGA CACCACGCTC 6 0 

CAGTCGTCAG CCCACTTCCT AGCTGAACAG CGCGAGGCGG CGGCAGCGAG CCGGGTCCCA 120 
CCATGGCCGC GAATTATTCC AGTACCAGTA CCCGGAGAGA ACATGTCAAA GTTAAAACCA 180 
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GCTCCCAGCC AGGCTTCCTG GAACGGCTGA GCGAGACCTC GGGTGGGATG TTTGTGGGGC 240 

TCATGGCCTT CCTGCTCTCC TTCTACCTAA TTTTCACCAA TGAGGGCCGC GCATTGAAGA 3 00 

CGGCAACCTC ATTGGCTGAG GGGCTCTCGC TTGTGGTGTC TCCTGACAGC ATCCACAGTG 3 60 

TGGCTCCGGA GAATGAAGGA AGGCTGGTGC ACATCATTGG CGCCTTACGG ACATCCAAGC 42 0 

TTTTGTCTGA TCCAAACTAT GGGGTCCATC TTCCGGCTGT GAAACTGCGG AGGCACGTGG 4 80 

AGATGTACCA ATGGGTAGAA ACTGAGGAGT CCAGGGAGTA CACCGAGGAT GGGCAGGTGA 540 

AGAAGGAGAC GAGGTATTCC TACAACACTG AATGGAGGTC AGAAATCATC AACAGCAAAA 600 

ACTTCGACCG AGAGATTGGC CACAAAAACC CCAGCTTCCT CTCTCCCACA GTGCCATGGC 6 60 

AGTGGAGTCA TTCATGGCAA CAGCCCCCTT TGTCCAAATT GGCAGGTTTT TCCTCTCGTC 72 0 

AGGCCTCATC GACAAAGTCG ACAACTTCAA GTCCCTGAGC CTATCCAAGC TGGAGGACCC 7 80 

,^ mr^mmnnr-n nTnn anarTT TTTCTACCAC AGCGAAAATC CCAAGTATCC 840 

nv- r-v j. i. — - - - 



AGAGGTGGGA GACTTGCGTG TCTCCTTTTC CTATGCTGGA CTGAGCGGCG ATGACCCTGA 900 

CCTGGGCCCA GCTCACGTGG TAACCTGGCT TCCCAGGGGC AGACACTAAG TCAGAGCCTC 960 

ACGACTTTCC TGGACACAGA CAC CTTGGTC AATGTCAGGA GCGC TTGGAC CCCCTTTTCC 1020 

CTGGGGAAAG GCACACTCTC GCACACACTC TCAGCCAGGC ACGCTTCTGA GCAGTTTCAG 10 80 

AGCTCCCATG TCCCCACAGC CATCCATGGA CCCCACGTTA AGAAGGGCAG CTCAAAAGGG 1140 

GTCTCATAGT CGCACCTTAT GACAGGTGTT CCAGTCACAC ACAGACCCTC TCCCCAAGCC 12 00 

CGTTTTGATC TGTCAATAAT TGGTCTTGCG TTCCTGGCCT ATGTGCAGTC CTGCCCCATC 1260 

CCCTGCTCTG CGCACTGCCC AAGAGCTTTG AATGCCTGGA GCTTTGAATG GAGCAGCTCA 13 20 

GCCAGAGCTG CAGAGGTGGA TGCATCCCAG ATGGATGTAT AGAGAGAGAA GCCCCAGGGT 13 80 

CTCTGTGCTC ACTTCCCCAG CCGGCACCCA GTCCCGGGAG GGTGGGCCAT GGCTCTCATG 1440 

GGCGTGTCTC CCGCTGGTCA CCCCTCAGCT CTAACACCAG GTCCTCTGAC CAGGTCACTG 1500 

TGATTGCCCG GCAGCGGGGT GACCAGCTAG TCCCATTCTC CACCAAGTCT GGGGATACCT 15 60 

TACTGCTCCT GCACCACGGG GACTTCTCAG CAGAGGAGGT GTTTCATAGA GAACTAAGGA 1620 

GCAACTCCAT GAAGACCTGG GGCCTGCGGG CAGCTGGCTG GATGGCCATG TTCATGGGCC 168 0 

TCAACCTTAT GACACGGATC CTCTACACCT TGGTGGACTG GTTTCCTGTT TTCCGAGACC 17 40 

TGGTCAACAT TGGCCTGAAA GCCTTTGCCT TCTGTGTGGC CACCTCGCTG AC CC TGCTGA 1800 

CCGTGGCGGC TGGCTGGCTC TTCTACCGAC CCCTGTGGGC CCTCCTCATT GCCGGCCTGG 1860 
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CCCTTGTGCC CATCCTTGTT GCTCGGACAC GGGTGCCAGC CAAAAAGTTG GAGTGAAAAG 192 0 

ACCCTGGCAC CCGCCCGACA CCTGCGTGAG CCCTAGGATC CAGGTCCTCT CTCACCTCTG 19 8 0 

ACCCAGCTCC ATGCCAGAGC AGGAGCCCCG GTCAATTTTG GACTCTGCAC CCCCTCTCCT 2 04 0 

CTTCAGGGGC CAGACTTGGC AGCATGTGCA CCAGGTTGGT GTTCACCAGC TCATGTCTTC 210 0 

CCCACATCTC TTCTTGC C AG TAAGCAGCTT TGGTGGGCAG CAGCAGCTCA TGAATGGCAA 216 0 

GCTGACAGCT TCTCCTGCTG TTTCCTTCCT CTCTTGGACT GAGTGGGTAC GGCCAGCCAC 2 22 0 

TCAGCCCATT GGCAGCTGAC AACGCAGACA CGCTCTACGG AGGCCTGCTG ATAAAGGGCT 2 2 80 

CAGCCTTGCC GTGTGCTGCT TCTCATCACT GCACACAAGT GCCATGCTTT GCCACCACCA 2 3 40 

CCAAGCACAT CTGTGATCCT GAAGGGCGGC CGTTAGTCAT TACTGCTGAG TCCTGGGTCA 2400 

CCAGCAGACA CACTGGGCAT GGACCCCTCA AAGCAGGCAC ACCCAAAACA CAAGTCTGTG 2 46 0 

GCTAGAACCT GATGTGGTGT T T AAAAG AG A AGAAACACTG AAGATGTCCT GAGGAGAAAA 2 52 0 

GC TGG AC AT A TACTGGGCTT C AC AC TTATC TTATGGCTTG GCAGAATCTT TGTAGTGTGT 2 58 0 

GGGATCTCTG AAGGCCCTAT TTAAGTTTTT CTTCGTTACT TTGCTGCTTC ATGTGTACTT 2 64 0 

TCCTACCCCA AGAGGAAGTT TTCTGAAATA AGATTTAAAA ACAAAACAAA AAAAAAAAAA 27 0 0 

AA 2702 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ala Ala Asn Tyr Ser Ser Thr Ser Thr Arg Arg Glu His Val Lys 
15 10 15 

Val Lys Thr Ser Ser Gin Pro Gly Phe Leu Glu Arg Leu Ser Glu Thr 



20 



25 



30 



Ser Gly Gly Met Phe Val Gly Leu Met Ala Phe Leu Leu Ser Phe Tyr 
35 40 45 



Leu lie Phe Thr Asn Glu Gly Arg Ala Leu Lys Thr Ala Thr Ser Leu 



76 



BNSDOCID: <WO 9832853A2 I > 



WO 98/32853 - PCT/US98/01396 



50 



55 60 



Ala Glu Gly Leu Ser Leu Val Val Ser Pro Asp Ser lie His Ser Val 
65 



70 75 80 



Ala Pro Glu Asn Glu Gly Arg Leu Val His He He Gly Ala Leu Arg 
85 90 95 

Thr Ser Lys Leu Leu Ser Asp Pro Asn Tyr Gly Val His Leu Pro Ala 
100 105 HO 

Val Lys Leu Arg Arg His Val Glu Met Tyr Gin Trp Val Glu Thr Glu 
115 120 125 

Glu Ser Arg Glu Tyr Thr Glu Asp Gly Gin Val Lys Lys Glu Thr Arg 
130 135 140 

Tyr Ser Tyr Asn Thr Glu Trp Arg Ser Glu He He Asn Ser Lys Asn 
145 150 155 160 

PhG Acp Arg Glu He Gly His Lys Asn Pro Ser Phe Leu Ser Pro Thr 
165 " 170 175 

Val Pro Trp Gin Trp Ser His Ser Trp Gin Gin Pro Pro Leu Ser Lys 
180 185 190 

Leu Ala Gly Phe Ser Ser Arg Gin Ala Ser Ser Thr Lys Ser Thr Thr 
195 200 205 



Ser Ser Pro 
210 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3395 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATCTTCCTGC CCTTCACCTG CATTGGCTAC ACGGCCACCA ATCAGGACTT CATCCAGCGC 
CTGAGCACAC TGATCCGGCA GGCCATCGAG CGGCAGCTGC CTGCCTGGAT CGAGGCTGCC 
AACCAGCGGG AGGAGGGCCA GGGTGAACAG GGCGAGGAGG AGGATGAGGA GGAGGAAGAA 
GAGGAGGACG TGGCTGAGAA CCGCTACTTT GAAATGGGGC CCCCAGACGT GGAGGAGGAG 
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GAGGGAGGAG GCC^^GGGA GGAAGAGGAG GAGGAAGAGG ARGATGAAGA RGCCGAGGAG 3 00 

GAGCGGCTGG CTCTGGAATG GGGCCTGGGC GCGGACGAAG ACTTCCTGCT GGAGC AC AT C 3 60 

CGCATCCTCA AGGTGCTGTG GTGCTTCCTG ATCCATGTGC AGGGCAGTAT CCGCCAGTTC 42 0 

GCCGCCTGCC TTGTGCTCAC CGACTTCGGC ATCGCAGTCT TCGAGATCCC GCACCAGGAG 480 

TCTCGGGGCA GCAGCCAGCA CATCCTCTCC TCCCTGCGCT TTGTCTTTTG CTTCCCGCAT 540 

GGCGACCTCA CCGAGTTTGG CTTCCTCATG CCGGAGCTGT GTCTGGTGCT CAAGGTACGG 60 0 

CACAGTGAGA ACACGCTCTT CATTATCTCG GACGCCGCCA ACCTGCACGA GTTCCACGCG 66 0 

GACCTGCGCT CATGCTTTGC ACCCCAGCAC ATGGCCATGC TGTGTAGCCC CATCCTCTAC 72 0 

GGCAGCCACA CCAGCCTGCA GGAGTTCCTG CGCCAGCTGC TCACCTTCTA CAAGGTGGCT 780 

GGCGGCTGCC AGGAGCGCAG CCAGGGCTGC TTCCCCGTCT ACCTGGTCTA CAGTGACAAG 84 0 

CGCATGGTGC AGACGGCCGC CGGGGACTAC TCAGGCAACA TCGAGTGGGC CAGCTGCACA 900 

CTCTGTTCAG CCGTGCGGCG CTCCTGCTGC GCGCCCTCTG AGGCCGTCAA GTCCGCCGCC 960 

ATCCCCTACT GGCTGTTGCT CACGCCCCAG CACCTCAACG TCATCAAGGC CGACTTCAAC 102 0 

CCCATGCCCA ACCGTGGCAC CCACAACTGT CGCAACCGCA AC AG CTTC AA GCTCAGCCGT 1080 

GTGCCGCTCT CCACCGTGCT GCTGGACCCC ACACGCAGCT GTACCCAGCC TCGGGGCGCC 114 0 

TTTGCTGATG GCCACGTGCT AGAGCTGCTC GTGGGGTACC GCTTTGTCAC TGCCATCTTC 12 00 

GTGCTGCCCC ACGAGAAGTT CCACTTCCTG CGCGTCTACA ACCAGCTGCG GGCCTCGCTG 12 6 0 

CAGGACCTGA AGACTGTGGT CATCGCCAAG ACCCCCGGGA CGGGAGGCAG CCCCCAGGGC 13 2 0 

TCCTTTGCGG ATGGCCAGCC TGCCGAGCGC AGGGCCAGCA ATGACCAGCG TCCCCAGGAG 13 8 0 

GTCCCAGCAG AGGCTCTGGC CCCGGCCCCA GTGGAAGTCC CAGCTCCAGC CCCTGCAGCA 144 0 

GCCTCAGCCT CAGGCCCAGC GAAGACTCCG GCCCCAGCAG AGGCCTCAAC TTCAGCTTTG 1500 

GTCCCAGAGG AGACGCCAGT GGAAGCTCCA GCCCCACCCC CAGCCGAGGC CCCTGCCCAG 156 0 

TACCCGAGTG AGCACCTCAT CCAGGCCACC TCGGAGGAGA ATCAGATCCC CTCGCACTTG 162 0 

CCTGCCTGCC CGTCGCTCCG GCACGTCGCC AGCCTGCGGG GCAGCGCCAT CATCGAGCTC 16 8 0 

TTCCACAGCA GCATTGCTGA GGTTGAAAAC GAGGAGCTGA GGCACCTCAT GTGGTCCTCG 1740 

GTGGTGTTCT ACCAGACCCC AGGGCTGGAG GTGACTGCCT GCGTGCTGCT CTCCACCAAG 180 0 

GCTGTGTACT TTGTGCTCCA CGACGGCCTC CGCCGCTACT TCTCAGAGCC ACTGCAGGAT 18 6 0 

TTCTGGMATC AGAAAAACAC SGACTACAAC AACAGCCCTT TCCACATCTC CCAGTGCTTC 192 0 
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GTGCTAAAGC TTAGTGACCT^IaGTCAGTC AATGTGGGGC TTTTCGACCA gWtTTCCGG 19 80 

CTGACGGGTT CCACCCCGAT GCAGGTGGTM ACGTGCTTGA CGCGGGACAG CTACCTGACG 2040 

CACTGCTTCC TCCAGCACCT CATGGTCGTG CTGTCCTCTC TGGAACGCAC GCCCTCGCCG 2100 

GAGCCTGTTG ACAAGGACTT CTACTCCGAG TTTGGGAACA AGACCACAGG GAAGATGGAG 2160 

AACTACGAGC TGATCCACTC TAGTCGCGTC AAGTTTACCT ACCCCAGTGA GGAGGAGATT 222 0 

GGGGACCTGA CGTTCACTGT GGCCCAAAAG ATGGCTGAGC CAGAGAAGGC CCCAGCCCTC 22 8 0 

AGCATCCTGC TGTACGTGCA GGCCTTCCAG GTGGGCATGC CACCCCCTGG GTGCTGCAGG 23 40 

GGCCCCCTGC GCCCCAAGAC ACTCCTGCTC ACCAGCTCCG AGATCTTCCT CCTGGATGAG 240 0 

GACTGTGTCC ACTACCCACT GCCCGAGTTT GCCAAAGAGC CGCCGCAGAG AGACAGGTAC 2460 

CGGCTGGACG ATGGCCGCCG CGTCCGGGAC CTGGACCGAG TGCTCATGGG CTACCAGACC 252 0 

^^^^rv- rrrwiprri' rrSTCTTCGAT GACGTGCAAG GTCATGACCT CATGGGCAGT 2580 

inv-v-^owno^ ^- ~ 

GTCACCCTGG ACCACTTTGG GGAGGTGCCA GGTGGCCCGG CTAGAGCCAG CCAGGGCCGT 2640 

GAAGTCCAGT GGCAGGTGTT TGTCCCCAGT GCTGAGAGCA GAGAGAAGCT CATCTCGCTG 270 0 

TTGGCTCGCC AGTGGGAGGC CCTGTGTGGC CGTGAGCTGC CTGTCGAGCT CACCGGCTAG 27 60 

CCCAGGCCAC AGCCAGCCTG TCGTGTCCAG CCTGACGCCT ACTGGGGCAG GGCAGCAGGC 282 0 

TTTTGTGTTC TCTAAAAATG TTTTATCCTC CCTTTGGTAC CTTAATTTGA CTGTCCTCGC 2 88 0 

AGAGAATGTG AACATGTGTG TGTGTTGTGT TAATTCTTTC TCATGTTGGG AGTGAGAATG 2940 

CCGGGCCCCT CAGGGCTGTC GGTGTGCTGT CAGCCTCCCA CAGGTGGTAC AGCCGTGCAC 3 000 
ACCAGTGTCG TGTCTGCTGT TGTGGGACCG TTGTTAACAC GTGACACTGT GGGTCTGACT 3 06 0 
TTCTCTTCTA CACGTCCTTT CCTGAAGTGT CGAGTCCAGT CCTTTGTTGC TGTTGCTGTT 312 0 
GCTGTTGCTG TTGCTGTTGG CATCTTGCTG CTAATCCTGA GGCTGGTAGC AGAATGCACA 3180 
TTGGAAGCTC CCACCCCATA TTGTTCTTCA AAGTGGAGGT CTCCCCTGAT CCAGACAAGT 3240 
GGGAGAGCCC GTGGGGGCAG GGGACCTGGA GCTGCCAGCA CCAAGCGTGA TTCC TGCTGC 33 00 
CTGTATTCTC TATTCCAATA AAGCAGAGTT TGACACCGTC AAAAAAAAAA AAAAAAAAAA 33 60 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA 3395 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gly Pro Pro Asp Val Glu Glu Glu Glu Gly Gly Gly Gin Gly Glu 
15 10 15 

Glu Glu Glu Glu Glu Glu Glu Asp Glu Glu Ala Glu Glu Glu Arg Leu 
20 25 30 

Ala Leu Glu Trp Ala Leu Gly Ala Asp Glu Asp Phe Leu Leu Glu His 
35 40 45 

lie Arg lie Leu Lys Val Leu Trp Cys Phe Leu lie His Val Gin Gly 
50 55 60 

Ser lie Arg Gin Phe Ala Ala Cys Leu Val Leu Thr Asp Phe Gly lie 
65 70 75 80 

Ala Val Phe Glu lie Pro His Gin Glu Ser Arg Gly Ser Ser Gin His 
85 90 95 

lie Leu Ser Ser Leu Arg Phe Val Phe Cys Phe Pro His Gly Asp Leu 
100 105 110 

Thr Glu Phe Gly Phe Leu Met Pro Glu Leu Cys Leu Val Leu Lys Val 
115 120 125 

Arg His Ser Glu Asn Thr Leu Phe lie lie Ser Asp Ala Ala Asn Leu 
130 135 140 

His Glu Phe His Ala Asp Leu Arg Ser Cys Phe Ala Pro Gin His Met 
145 150 155 160 

Ala Met Leu Cys Ser Pro lie Leu Tyr Gly Ser His Thr Ser Leu Gin 
165 ' 170 175 

Glu Phe Leu Arg Gin Leu Leu Thr Phe Tyr Lys Val Ala Gly Gly Cys 
180 185 190 

Gin Glu Arg Ser Gin Gly Cys Phe Pro Val Tyr Leu Val Tyr Ser Asp 
195 200 205 

Lys Arg Met Val Gin Thr Ala Ala Gly Asp Tyr Ser Gly Asn lie Glu 
210 215 220 

Trp Ala Ser Cys Thr Leu Cys Ser Ala Val Arg Arg Ser Cys Cys Ala 
225 230 235 240 
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Pro Ser Glu Ala Lys Ser Ala Ala He Pro Tyr T^^Leu Leu Leu 

245 250 255 

Thr Pro Gin His Leu Asn Val He Lys Ala Asp Phe Asn Pro Met Pro 
260 265 270 

Asn Arg Gly Thr His Asn Cys Arg Asn Arg Asn Ser Phe Lys Leu Ser 
275 280 285 

Arg Val Pro Leu Ser Thr Val Leu Leu Asp Pro Thr Arg Ser Cys Thr 
290 295 300 

Gin Pro Arg Gly Ala Phe Ala Asp Gly His Val Leu Glu Leu Leu Val 

305 310 315 320 

Gly Tyr Arg Phe Val Thr Ala He Phe Val Leu Pro His Glu Lys Phe 
325 330 335 

His Phe Leu Arg Val Tyr Asn Gin Leu Arg Ala Ser Leu Gin Asp Leu 
340 345 350 

Lys Thr Val Val He Ala Lys Thr Pro Gly Thr Gly Gly Ser Pro Gin 
355 360 365 

Gly Ser Phe Ala Asp Gly Gin Pro Ala Glu Arg Arg Ala Ser Asn Asp 
370 375 380 

Gin Arg Pro Gin Glu Val Pro Ala Glu Ala Leu Ala Pro Ala Pro Val 
385 390 395 400 

Glu Val Pro Ala Pro Ala Pro Ala Ala Ala Ser Ala Ser Gly Pro Ala 
405 410 415 

Lys Thr Pro Ala Pro Ala Glu Ala Ser Thr Ser Ala Leu Val Pro Glu 
420 425 430 

Glu Thr Pro Val Glu Ala Pro Ala Pro Pro Pro Ala Glu Ala Pro Ala 
435 440 445 

Gin Tyr Pro Ser Glu His Leu He Gin Ala Thr Ser Glu Glu Asn Gin 
450 455 460 

He Pro Ser His Leu Pro Ala Cys Pro Ser Leu Arg His Val Ala Ser 
465 470 475 480 

Leu Arg Gly Ser Ala He He Glu Leu Phe His Ser Ser He Ala Glu 
485 490 495 

Val Glu Asn Glu Glu Leu Arg His Leu Met Trp Ser Ser Val Val Phe 
500 505 510 

Tyr Gin Thr Pro Gly Leu Glu Val Thr Ala Cys Val Leu Leu Ser Thr 
515 520 525 



Lys 



Ala Val Tyr Phe Val Leu His Asp Gly Leu Arg Arg Tyr Phe Ser 
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Glu Pro Leu Gin Asp Phe Trp Xaa Gin Lys Asn Thr Asp Tyr Asn Asn 
545 550 555 560 

Ser Pro Phe His lie Ser Gin Cys Phe Val Leu Lys Leu Ser Asp Leu 
565 570 575 

Gin Ser Val Asn Val Gly Leu Phe Asp Gin His Phe Arg Leu Thr Gly 
580 585 590 

Ser Thr Pro Met Gin Val Val Thr Cys Leu Thr Arg Asp Ser Tyr Leu 
595 600 605 

Thr His Cys Phe Leu Gin His Leu Met Val Val Leu Ser Ser Leu Glu 
610 615 620 

Arg Thr Pro Ser Pro Glu Pro Val Asp Lys Asp Phe Tyr Ser Glu Phe 
625 630 635 640 

Gly Asn Lys Thr Thr Gly Lys Met Glu Asn Tyr Glu Leu lie His Ser 
645 650 655 

Ser Arg Val Lys Phe Thr Tyr Pro Ser Glu Glu Glu lie Gly Asp Leu 
660 665 670 

Thr Phe Thr Val Ala Gin Lys Met Ala Glu Pro Glu Lys Ala Pro Ala 
675 680 685 

Leu Ser lie Leu Leu Tyr Val Gin Ala Phe Gin Val Gly Met Pro Pro 
690 695 700 

Pro Gly Cys Cys Arg Gly Pro Leu Arg Pro Lys Thr Leu Leu Leu Thr 
705 710 715 720 

Ser Ser Glu lie Phe Leu Leu Asp Glu Asp Cys Val His Tyr Pro Leu 
725 730 735 

Pro Glu Phe Ala Lys Glu Pro Pro Gin Arg Asp Arg Tyr Arg Leu Asp 
740 745 750 

Asp Gly Arg Arg Val Arg Asp Leu Asp Arg Val Leu Met Gly Tyr Gin 
755 760 765 

Thr Tyr Pro Gin Ala Leu Thr Leu Val Phe Asp Asp Val Gin Gly His 
770 775 780 

Asp Leu Met Gly Ser Val Thr Leu Asp His Phe Gly Glu Val Pro Gly 
785 790 795 800 

Gly Pro Ala Arg Ala Ser Gin Gly Arg Glu Val Gin Trp Gin Val Phe 
805 810 815 

Val Pro Ser Ala Glu Ser Arg Glu Lys Leu lie Ser Leu Leu Ala Arg 
820 825 830 
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Gin Trp Glu Ala blT Cys Gly Arg Glu Leu Pro Val Gl^Leu Thr Gly 

835 840 845 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 1147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



60 
120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGAAGGAGTT CTGGAATTGG AAAACCATCA TTTTTCAACC ATCACAGTAA ATATGGCTCA 
GGCAAGAATT ATCAATCAAT GCTAAAGCTA GGGGGAAATT TCGCTTAGGA GCAGGATATT 
AGGGTATTAG TCTGGGCTTA AAGTATCTCC TCACAGATTG TTGTTAGTTT CTGGGGAAAG 180 
AATAGTAACC ATGCAATGGA AAAAAATGGA CAACCTCTTG ACTAGGTTAT CAAAATTAAC 240 
CTCACCAATA AAGGGTGGAT GTTCAACATG TGCCTTCAAA TGTGACCCAC TGAGAAGGAA 
ACAACATCAC TGTAACAACA ACAACCAGAA ACGACAGGGG GTTTTGACTG AATTCTTCAA 
AAATGTCAAT GTCATAGAAG ACAAAGAAAG GTTGTGGAAA TGTTTCAGAT TAAATGATAG 
T AAAAAC AC C TGACAACTAA ACATAGTAAG T AAT AC TAG A CTGGATTCTG TACCAGAGGT 
AACATAAGTG CTCCAAAGGA CAATGTTAGG TCAACTGGCA AATTGGAATA TAGACAGTCA 
ATCAGATAAG AAGTATACTT TGATTAAGTA AAAAAAATCC CTATTCTTGG AAAATACACA 6 00 

ATAAAGTATT TTGAGGTAAA GGGC CAT AAT GTATGCAATC TACTCTCAAA AAATTCAGAA 
ACATATATTT GTGTGCATTT GCATGTGCAA CAGTACACAC AAACATACAT AAAGAGAGCA 
ATTGATAAGG CAAATAAGGT AACATTTAAC AATAATCTGA TACACATAAA TAGAGAAAGA 
GCAATTGATA AAGTAAATGA GGTAAAATTT AACAATAATC TGAGCAAAAG GTATATGTGT 
TTTCTTTGAG ACAGTCTGAT TCTTGCAACT TATTCTGTAA GTTGGAACTT ATTTCCAAAC 
ATGATTGAAA AAAAACCCCG CACTTGGCAA CTTCTTCTCT TTTTCAGCCT AGAAATGTCT 
GTGTTAAGTG GTTTTTTATT TATTGTTGTT GTTTGTTGTT ATTGTTGTTT TGTTGCCAGG 
CTCCAACTCA CAAAATACGA GTTTAAAAAC TGCGTTGTTA TTTTTAGAGA TTTGTGATAA 10 8 0 
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TACAACTTGT TATAjffivrTT ATTCCTCAAT AAATATAATT TCTCT^TAT GCAAAAAAAA 114 0 



AAAAAAA 

(2) INFORMATION FOR SEQ ID NO : 1 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met lie Glu Lys Lys Pro Arg Thr Trp Gin Leu Leu Leu Phe Phe Ser 
15 10 15 

Leu Glu Met Ser Val Leu Ser Gly Phe Leu Phe lie Val Val Val Cys 
20 25 30 

Cys Tyr Cys Cys Phe Val Ala Arg Leu Gin Leu Thr Lys Tyr Glu Phe 
35 40 45 

Lys Asn Cys Val Val He Phe Arg Asp Leu 
50 55 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

C C TTTT T AAA AAATATCTGA AAAAAGCTTC ATATCTTTAC AAACTCATAA AATAGCTGAT 60 

TGGGCCATGG AGGAGATGAG GC TGTTTAG A ACTGGTTTTG TTTCAAGTTT GTCAATTTTC 120 

CCTGTATGAG AACTTGGGTA AAGCACAAAG AAACATACAG TGCTAGTAAC AGGTCTCCTG 180 

CGCCCTGGAA CTAAGTGTTT GGAGGAAGGA CTAAACCCCG GGGGAGGTGA GTATAAAATA 2 40 

ATTCCACTAA GATCACCTCC TCAGTCCCCA GAAGGCTGAT GGTGGATCCT CTGGCCATCT 3 00 
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CCTGTGGGGT CTTACTGCTcTIcTGCCATT TCTCTATGCC TGAAGACACG JHGATGATAT 3 60 
CAAGGCAGAG CTACCATATC GCAGCCAGTC TCTAGGCTAC TGCTGTGCAG TGGCTCCCAC 42 0 
TTTCTAATGC TTTTTTGTTT TTGCTTTTTC TAACAAAACA ATCTTTTTTC AAAATGAATT 480 
CCAACCCCTG CTAGTTCCTT CGCTGCCTCC ATACTGTTTT AGGCAGCACC GTTTATGTGA 540 
CAGAGTCCGT GTTTCTCAAA TGCATGGTGT TCCTCAGGTG GAGAGTGGGC AGAAGTTTTT 600 
GCAACACTTT TTTTTTAAGT TATTGGGTGC AAAATCCCAA ACCAGGATAT GTGTATGTCT 660 
GTGTGTTTAT GTTTTTTATT TGACCCTCCC CTCTTTCAAC CTACCCCCTT TTATATCTAA 720 
TGTAGAAAAA GCGAAATTGA ATCTGGAAAG CAAACTGTTG TATATAGTTG CGGTAACAAT 7 80 
CATGAAGAGA GAGCCGGGCT GTCCCCTCAG TAATTCATTT TAAATAACAA ATTATTTAAA 840 
AATAAAATTC ATGCCAGAGC CAGCTGAAGA GGCCTTCCTT CATCACCACT GAGGCCACCC 900 
„„„,^av-nv^ Arrv-TnnrATG TPTCCTCCCA GCAAGATTCA TCTGTTCAAT 960 

L-^jHkJ-Vl V_ 1 UUO XV- -lvJ-lw^ - - 

GCCATTTGCG TTTCAATAAA GTTATCTCCT GTACTGTCAA AAAAAAAAAA AAA 1013 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met His Gly Val Pro Gin Val Glu Ser Gly Gin Lys Phe Leu Gin His 
15 io 15 

Phe Phe Phe Lys Leu Leu Gly Ala Lys Ser Gin Thr Arg He Cys Val 
20 25 30 

Cys Leu Cys Val Tyr Val Phe Tyr Leu Thr Leu Pro Ser Phe Asn Leu 
35 40 45 

Pro Pro Phe He Ser Asn Val Glu Lys Ala Lys Leu Asn Leu Glu Ser 
50 55 60 



Lys Leu Leu Tyr He Val Ala Val Thr He Met Lys Arg Glu Pro Gly 
6 5 70 75 80 

Cys Pro Leu Ser Asn Ser Phe 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1763 base pairs 

( B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TCGGGATAAA AAGCAAGAAA AGAAAGAGAA GACTGAGAAT AAGAAGATCT CTTTGAAAAT 
AAAATAAGAC TGCTAAAAGT ATTTGGTATA CAGTCTGGAA AATAAAGTTG AGGGAATCTC 
TCCAGATAAA GAGCAAAAAG AAATAGATAG AAAAATATAA AGAAAGAAAA AAGACATAGA 
CAATCAATAT GTAATGTTAG GAGTTCCTGG AAGAGAGAAC AGAGACAGTG TAGGTGAAGA 
AATAAAAAGA AAAAGAATTG AAGAACAGAG CAAGCTAAGT CTCCAGATTG AGAGGGCCCA 
ATACAATCTA CATCTAGACA CAATATTGTA AAATTTTGGA ATATTAAGGA TAGAAGGAAG 
ATATTAAAGT GGCCAGGGAG AAAACAAATG AGGTCATCAC GATTAGCTCA ACACAAAAAT 
GGATGAGAAA TAGACTGCTA ACAGATTTGT CATCAGCAAC ACTGAATGCC AGAAGTCAAT 
GGATCAACAT CTTCAGAGCT TAAGGAAAAT TTTTGTACCT AGAATTTCAT AGTAAGGCAG 
ACTGTCAAGA AGAACATCAA AGTGAAGACA TTTTCTGTCA GGCAAATTTT CAGAAAGTCT 
CCTTTGCACC CTTACTGAGG AAGTATCTTG AGGAAATTCT CCAGCAAAAT GAGGATGAAA 
ACCAGGAAAG AAGAAGAAAT GGGATCCATA AAACAGTGGA CCTTACTTAG GATGTCTCAT 
TCTAGAGTGA CAGCCAAAAG GGTATCTCAC CCTAGAGTGA CAGCTATCCA GCAGACTAAT 
TTCAGATGAG AGCATACTGT CTCGGGCTTT CTGGGAAGAA TGTGCATTCA GTGCCATAGA 
TAGTATCACT GAAGAGCTGG GATGCTTGAG AAGATTATTT AGTCAAGAAA AAAGAAAGAC 
AAATCAACAA TATGTCAAAA AATTCAGGTC CAATTATAGA GCAAAATAAA ATGAGGCATG 
ATTTTGAGTT ATTCATGAAG AATAAGAAGA GGCTTGATAG GTACATTTCC TTTTCTATGG 
CACAGGCATG ATGATATTGG GTGTGTAGGG AAGAAAATAT CCTAGCTTAT ACTAGGCTCC 
CAGTAAGAAG TATTTAAATA GCCAAAATAA TGTGGATATC ATTTATTAGT ATTCAATGTT 



60 
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180 
240 
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86 



BNSDOCID: <WO 9832853A2 I > 



W^r 7o/J*u3«J ^^^^ 




I 


PCT/US98/01396 


CAGATCAGCC TATTAACAAA GTGTGAAAGG 


TTTPATTTTT 

1 1 ± \„ r\ 1 J. 1 1 X 


TATTCAGAAC 


'BvAGTTGAA 


1200 


AGTAATTAAT GCTGACAAAG G GAAAbAAAb 


P AH A A ACJAC4A 


TTGAGAATTA 


GAGGAAGAGA 


1260 


AGTGGAATC A AAGGTAGAGA TAC 1 1 A 1 A 1 A 




GGATGAAAAG 


ATCTTCAGTT 


1320 


AATGGAACAA GAACTAGAGG ATTAGIG1A1 


loll \„flruivj 


TATAAAATCA 


AACCAATAGA 


1380 


TGTATTAAAA AGTGATGTAA CIAlLAbALA 


TTTY^HIAQACIA 


GATGGACAAA 


GGAAAGTGGC 


1440 


GATAGTGTAA GTTAAATCCT TAILiil ibi 


A ATflfiClGA AT 


TATTAAAGAT 


GTTGTAAAGT 




CAGTAAGTCA AGAAATTATT GCTCAAACA1 


ATTATTTA A A 
rt. x l /A xxx xvrvr^ 


GTTAGAAAGT 


TACCAGACGA 


1560 


TCTAAAATAA ATATTGTTAA AAGCATTACC 


I. V X rVJV-3vJ/i-t\ X 


GGGATTTAGA 


TTTAAAAAGG 


1620 


GTGGGATGGG AAACTGTGTT TTTCATTTTA 


AGTCCTTCTG 


TACTATTTAA 


TTTTTTACCT 


1680 


TGTGC ATG 1 A 11 AL 1 1 1 £\r\r^f\ x x x x x i~i 


ATAAACCCAA AT AAAAATC T AAAAAAAAAA 


1 / 'lU 










1763 


(2) INFORMATION FOR SEQ ID NO: 22: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Arg Met Lys Thr Arg Lys Glu Glu Glu Met Gly Ser He Lys Gin 
1 5 10 15 

Trp Thr Leu Leu Arg Met Ser His Ser Arg Val Thr Ala Lys Arg Val 
20 25 • 30 

Ser His Pro Arg Val Thr Ala He Gin Gin Thr Asn Phe Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(A) 



:CRIPTION: /desc = 



oligonucleo 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



CNATCTTAGAG CTCAAAGTTA GGGTCTG 



28 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CNCAGAGCTGT TCTGATACTA AGTCTCAC 29 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ANACTATCTTC TTCAGAGGCC AGATCACC 2 9 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CNAGAAGCCAG CTGGCTTTGA ATTTCCTC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CNTTTTCCAAT ATGCTTCAAT GGCTCCGT 
(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 
TNGGTAGAAGG AGAGCAGGAA GGCCATGA 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 



GNCTTCTCTGG CTCAGCCATC TTTTGGGC 



29 



(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 
CNGTACACACA AACATACATA AAGAGAGC 29 
(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
ANACGGACTCT GTCACATAAA CGGTGCTG 2 9 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

29 



GNTG AG AT AC C CTTTTGGCTG TCACTCTA 



[2) INFORMATION FOR SEQ ID NO : 3 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asn Asn Leu Leu Thr Arg Leu Ser Lys Leu 
10 15 

Thr Ser Pro He Lys Gly Gly Cys Ser Thr Cys Ala Phe Lys Cys Asp 
20 25 30 

Pro Leu Arg Arg Lys Gin His His Cys Asn Asn Asn Asn Gin Lys Arg 
35 40 45 

Gin Gly Val Leu Thr Glu Phe Phe Lys Asn Val Asn Val He Glu Asp 
50 " 60 

Lys Glu Arg Leu Trp Lys Cys Phe Arg Leu Asn Asp Ser Lys Asn Thr 
65 70 75 80 
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What is claimed isr 

1. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 506 to nucleotide 643; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 471 to nucleotide 765; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone AA35_2 deposited under accession 
number ATCC 98303; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AA35_2 deposited under accession number ATCC 98303; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AA35_2 deposited under accession number 
ATCC 98303; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AA35_2 deposited under accession number ATCC 98303; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

2. A composition of claim 1 wherein said polynucleotide is operably linked 
to at least one expression control sequence. 

3. A host cell transformed with a composition of claim 2. 
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claim 3, wherein said cell is a mami^r 



4. The host ceWf claim 3, wherein said cell is a mammaTian cell. 

5. A process for producing a protein encoded by a composition of claim 2, 

which process comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable culture 

medium; and 

(b) purifying said protein from the culture. 

6. A protein produced according to the process of claim 5. 

7. The protein of claim 6 comprising a mature protein. 

q A mmnnsition comprising; a protein, wherein said protein comprises an 

^" - ' ■ r * ^ 

amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino acid 1 to 
amino acid 32; 

(c) fragments of the amino acid sequence of SEQ ID NO:2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
AA35_2 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. 

9. The composition of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2. 

10. The composition of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2 from amino acid 1 to amino acid 32. 

11. The composition of claim 8, further comprising a pharmaceutically 
acceptable carrier. 

12. A method for preventing, treating or ameliorating a medical condition 
which comprises administering to a mammalian subject a therapeutically effective amount 
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13. An^olated gene corresponding to the cDN^^equence of SEQ ID NO:l. 



14. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 71 to nucleotide 736; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 113 to nucleotide 736; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 1 to nucleotide 343; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone AM42_3 deposited under accession 
number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AM42_3 deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AM42_3 deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AM42_3 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 
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15. A compos^S comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) the amino acid sequence of SEQ ID NO:4 from amino acid 1 to 
amino acid 91; 

.(c) fragments of the amino acid sequence of SEQ ID NO:4; and 
(d) the amino acid sequence encoded by the cDNA insert of clone 
AM42_3 deposited under accession number ATCC 98303; 
the protein being substantially free from other mammalian proteins. 

16. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:3. 

it a „ — nnoiHnn rnmnrisinc an isolated polynucleotide selected from the 

group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 55 to nucleotide 423; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone BG137_7 deposited under accession 

number ATCC 98303; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone BG137_7 deposited under accession number ATCC 98303; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BG137_7 deposited under accession number 
ATCC 98303; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BG137_7 deposited under accession number ATCC 98303; 

(g) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:6; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:6 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 



/A oUavh' 
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(j) a polynucleotide which encodes a species horn ologue of the protein 
of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

18. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) the amino acid sequence of SEQ ID NO:6 from amino acid 62 to 
amino acid 123; 

(c) fragments of the amino acid sequence of SEQ ID NO:6; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BG137_7 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. 

19. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:5. 

20. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 186 to nucleotide 2030; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 873 to nucleotide 2030; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 802 to nucleotide 1173; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CH699_1 deposited under accession 
number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CH699_1 deposited under accession number ATCC 98303; 
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(g) a p^^Tucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CH699_1 deposited under accession number 




ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CH699_1 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:8; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:8 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

21. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:8; 

(b) the amino acid sequence of SEQ ID NO:8 from amino acid 218 to 
amino acid 329; 

(c) fragments of the amino acid sequence of SEQ ID NO:8; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CH699_1 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. 

22. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:7. 

23. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 



(a) 



a polynucleotide comprising the nucleotide sequence of SEQ ID 



NO:10; 



(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:10 from nucleotide 111 to nucleotide 677; 
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(c) 



a polynucleotide comprising the nucleotide sequence of SEQ ID 



NO:10 from nucleotide 156 to nucleotide 677; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone C0851_l deposited under accession 
number ATCC 98303; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone C0851_l deposited under accession number ATCC 98303; 



ATCC 98303; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone C0851_l deposited under accession number ATCC 98303; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:ll; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:ll having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

24. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:ll; 

(b) the amino acid sequence of SEQ ID NO:ll from amino acid 120 to 
amino acid 189; 



(d) the amino acid sequence encoded by the cDNA insert of clone 
C0851_l deposited under accession number ATCC 98303; 
the protein being substantially free from other mammalian proteins. 



(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone C0851_l deposited under accession number 



fragments of the amino acid sequence of SEQ ID NO: 11; and 
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25. An isolated^ne corresponding to the cDNA sequenc^of SEQ ID NO:10, 
SEQ ID NO:9 or SEQ ID NO:12 . 



26. A composition comprising an isolated polynucleotide selected from the 

group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:13; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 123 to nucleotide 755; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 13 from nucleotide 279 to nucleotide 755; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 1 to nucleotide 631; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CP111_1 deposited under accession 
number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CP111_1 deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CP111_1 deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CP111_1 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:14; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 14 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 

of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 
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27. A composition comprising a protein, wherem said protein comprises an 
amino acid sequence selected from the group consisting of; 

(a) the amino acid sequence of SEQ ID NO:14; 

(b) the amino acid sequence of SEQ ID NO: 14 from amino acid 1 to 
amino acid 171; 

. (c) fragments of the amino acid sequence of SEQ ID NO:14; and 
(d) the amino acid sequence encoded by the cDNA insert of clone 
CP111_1 deposited under accession number ATCC 98303; 
the protein being substantially free from other mammalian proteins. 

28. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:13. 

29. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:15; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 214 to nucleotide 2760; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 406 to nucleotide 2760; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 15 from nucleotide 2011 to nucleotide 2565; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CS278„1 deposited under accession 
number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CS278_1 deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CS278„1 deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CS278_1 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:16; 
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(j) a p^^ucleotide encoding a protein compris^^a fragment of the 
amino acid sequence of SEQ ID NO:16 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

30. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO: 16; 

(b) the amino acid sequence of SEQ ID NO:16 from amino acid 596 to 
amino acid 784; 

(c) fragments of the amino acid sequence of SEQ ID NO: 16; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CS278_1 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. 

31. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:15. 

32. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO: 17; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 901 to nucleotide 1074; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 970 to nucleotide 1074; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 626 to nucleotide 1147; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone DF968_3 deposited under accession 
number ATCC 98303; 
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(f) ^^ a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone DF968J3 deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone DF968_3 deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone DF968_3 deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 18; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

33. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO: 18; 

(b) fragments of the amino acid sequence of SEQ ID NO: 18; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
DF968_3 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. 

34. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:17. 

35. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:19; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:19 from nucleotide 560 to nucleotide 820; 
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(c) a pl^ucleotide comprising the nucleotide^Rfuence of the full- 
length protein coding sequence of clone DN1120_2 deposited under accession 
number ATCC 98303; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone DN1120_2 deposited under accession number ATCC 98303; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone DN1120_2 deposited under accession number 
ATCC 98303; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone DN1120_2 deposited under accession number ATCC 98303; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:20; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:20 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 

of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

36. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:20; 

(b) the amino acid sequence of SEQ ID NO:20 from amino acid 1 to 
amino acid 61; 

(c) fragments of the amino acid sequence of SEQ ID NO:20; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
DN1120_2 deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. 

37. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:19. 
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nposition comprising an isolated polynui 



38. ;<^omposition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:21; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:21 from nucleotide 649 to nucleotide 786; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:21 from nucleotide 736 to nucleotide 786; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:21 from nucleotide 525 to nucleotide 787; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone D0589_l deposited under accession 
number ATCC 98303; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone D0589_l deposited under accession number ATCC 98303; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone D0589_l deposited under accession number 
ATCC 98303; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone D0589_l deposited under accession number ATCC 98303; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:22; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:22 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

39. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:22; 
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\f^^its of the amino acid sequence of SE^^^ 



(b) fra J^nts of the amino acid sequence of SEC^D NO:22; and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
D0589_l deposited under accession number ATCC 98303; 

the protein being substantially free from other mammalian proteins. 

40. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:21. 
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FIGURE 1A 

Hindlll 1 




BamHI 2772 Clal 2666 



Plasmid name: pED6dpc2 * 
Plasmid size: 5374 bp 

Comments/References: pED6dpc2 is derived from pED6dpc1 by insertion of a new 
polylinker to facilitate cDNA cloning. SST cDNAs are cloned between EcoRI and Notl. 
pED vectors are described in Kaufman et al.(1991), NAR 19: 4485-4490. 
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FIGURE IB 



Pvull 4439, 
Ndel 4315. 



Hindlll 1 



amHI 345 



Nhel 2691 




Ad VAI^^^* Nael 1488 

Clal 1821 



BamHI 1927 



Sapl 2135 



Plasmid name: pNOTs 
Plasmid size: 4529 bp 



Comments/References: pNOTs is a derivative of pMT2 (Kaufman at al,1989. Mol. Cell. Biol. 9: 174 1 -1750). 
QHFR was deleted and a new polylinker was inserted between EcoRI and Hpal. M13 origin 
of replication was inserted in the Clal site. SST cDNAs are cloned between EcoRI and 
Notl 
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148 Human Secreted Proteins 

Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
organelle, contains different proteins essential for the function of the organelle. The cell 
.._.:„„ ,;„„.,n- » „,hinh »rr amino acid motifs located within the protein, to target 

proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 
(ER). The ER separates the membrane-bounded proteins from all other types of 
proteins Once localized to the ER, both groups of proteins can be further directed to 
another organelle called the Golgi apparatus. Here, the Golgi distributes the proteins to 
vesicles, including secretory vesicles, the cell membrane, lysosomes, and the other 
organelles. 

20 " Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocytosis can occur constitutively or after receipt 
of a triggerine signal. In the latter case, the proteins are stored in secretory vesicles (or 
25 secretory granules) until exocytos.s is triggered. Similarly, proteins residing on the cell 
membrane can also be secreted into the extracellular space by proteolytic cleavage ot a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 
30 the commercially valuable human insulin, interferon, Factor VIII. human growth 
hormone, tissue plasminogen activator, and erythropoe.tm. Thus, in light of the 
pervasive role of secreted proteins in human phys.ology. a need exists for identifying 
and characterizing novel human secreted proteins and the genes that encode them. This 
t „™„.,HoP will niinw one to detect, to treat, and to prevent medical disorders by using 
35 secreted proteins or the genes that encode them. 
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Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
5 and recombinant methods for producing the polypeptides and polynucleotides. Also 
provided are diagnostic methods for detecting disorders related to the polypeptides, and 
therapeutic methods for treating such disorders. The invention further relates to 
screening methods for identifying binding partners of the polypeptides. 

10 Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 

15 environment (e.g., the natural environment if it is naturally occurring ), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 

20 In the present invention, a "secreted" protein refers to those proteins capable of 

being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extracellular space, the secreted protein can undergo extracellular processing to produce 

25 a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence 

30 of the full length cDNA sequence, including the 5' and 3' untranslated sequences, the 
coding region, with or without the signal sequence, the secreted protein coding region, 
as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated 
amino acid sequence generated from the polynucleotide as broadly defined. 

35 In the present invention, the full length sequence identified as SEQ ID NO:X 

was often generated by overlapping sequences contained in multiple clones (contig 



BNSDOCID: <WO 9922243A1 I > 



PCT7US98/22376 

WO 99/22243 



analyse). A representative clone containing all or most of the sequence for SEQ ID 
NO-X was deposited with the American Type Culture Collection ("ATCC"). As 
shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 

5 Manassas, Virginia 20 1 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the international recognition of the deposit of 
microorganisms for purposes of patent procedure. 

A •'polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 

10 in SEQ ID NO:X, the complement thereof, or the cDNA within the clone deposited with 
the ATCC. "Stringent hybridization conditions" refers to an overnight incubation at 42° 

• • tn « e s<;r r750 mM NaCl. 75 mM sodium 

C in a solution comprising jvw — ~- ~ v 

citrate) 50 mM sodium phosphate (pH 7.6). 5x Denhardt's solution, 10% dextran 
sulfate, and 20 Ug/ml denatured, sheared salmon sperm DNA, followed by washing the 
15 filters in 0.1 x SSC at about 65°C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower percentages 
20 of formamide result in lowered stringency); salt conditions, or temperature. For 
example, lower stringency conditions include an overnight incubation at 3TC in a 
solution comprising 6X SSPE (20X SSPE = 3M NaCl: 0.2M NaH 2 PCy, 0.02M EDTA, 
pH 7.4). 0.5% SDS, 30% formamide. 100 ug/ml salmon sperm blocking DNA; 
followed by washes at 50°C with 1XSSPE. 0.1% SDS. In addition, to achieve even 
25 lower stringency, washes performed following stringent hybridization can be done at 
higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
30 Denhardt's reasent, BLOTTO, heparin, denatured salmon sperm DNA, and 

commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
35 as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 
5 The polynucleotide of the present invention can be composed of any 

polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double-stranded DNA, DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and 

10 double-stranded regions, hybrid molecules comprising DNA and RNA that may be 

single-stranded or, more typically, double-stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability 

15 or for other reasons. "Modified" bases include, for example, tritylated bases and 

unusual bases such as inosine. A variety of modifications can be made to DNA and 
RNA; thus, "polynucleotide" embraces chemically, enzymatically, or metabolically 
modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 

20 to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and 
may contain amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as posttranslational 
processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 

25 as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 

30 branched , for example, as a result of ubiquitination, and they may be cyclic, with or 
without branching. Cyclic, branched, and branched cyclic polypeptides may result 
from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 

35 nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 

covalent attachment of phosphotidylinositof cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine. 
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formation of pyroglutamate. formylation, gamma-carboxylation. glycosylate, GPI 
anchor formation, hydroxylation. lodination. methylation. mynstoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation. racemization, 
selenoylation, sulfation, transfcr-RNA mediated addition of amino acids to proteins 
5 such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 

STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. 

H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York, pgs. 1-12 (1983); Seifter et al., Meth Enzymol 182:626-646 (1990); 
0 Rattan et al., Ann NY Acad Sci 663:48-62 ( 1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO:Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table 1. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
5 activity similar, but not necessarily identical to, an activity of a polypeptide of the 

present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it 
need not be identical to that of the polypeptide, but rather substantially similar to the 
dose-dependence in a given activity as compared to the polypeptide of the present 
>0 invention (i.e., the candidate polypeptide will exhibit greater activity or not more than 
about 25-fold less and, preferably, not more than about tenfold less activity, and most 
preferably, not more than about three-fold less activity relative to the polypeptide of the 
present invention.) 

25 Polynucleotide and Pol Y p-ptides of the Invention 



30 



FEATURES OF PROTEIN ENCODED BY GENE NO: 1 



In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: 

MRFISQQSCECVRPCMDVYVCVYISIHVYMDAHVYLCRICKTNMR (SEQ ID 
NO:309).RILRWVNCMACDLYLNKAVSVCAHVWMCMCVYISLYMYTWMP 
35 MCIYVEYVKQT (SEQ ID NO:3 10). NPENQLEI S FPPRRQ KMKLTLDLQ VSQS 
SLVHSLLSSDFFSVSKEGCLWKPILLPSHFL (SEQ ID NO:31 1). LQTQISN 
YLMFVLHTLHRYTWASMYTCIEIYTHTYTSIHGRTHSQLC (SEQ ID NO:312). 
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IHMGIHVYMYRDIYTHIHIHTWAHTLTALLRYKSHAIQLTHLNIR (SEQ ID 
NO:313), and/or MKWIFTVLILTSCFFTAGICEDGICSRIQL RDKIVQSAFRQ (SEQ 
ID NO:314). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

5 This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders, particularly neutropenia and related conditions. 

10 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g. immune, hematopoeitic, and cancerous and 

15 wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid 

and spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e.. the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in neutrophils indicates that polynucleotides and 

20 polypeptides corresponding to this gene are useful for immune disorders. More 

specifically, this gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). Since the gene is expressed 
in cells of lymphoid origin, the natural gene product may be involved in immune 

25 functions. Therefore it may be also used as an agent for immunological disorders 
including arthritis, asthma, immunodeficiency diseases such as AIDS, leukemia, 
rheumatoid arthritis, granulomatous disease, inflammatory bowel disease, sepsis, acne, 
neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated 
cytotoxicity; immune reactions to transplanted organs and tissues, such as host-versus- 

30 graft and graft-versus-host diseases, or autoimmunity disorders, such as autoimmune 
infertility, lense tissue injury, demyelination, systemic lupus erythematosis, drug 
induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, scleroderma and 
tissues. In addition, this gene product may have commercial utility in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the 

35 differentiation and/or proliferation of various cell types. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO- 1 1 and may have been publicly available prior to conception ol the present 
mention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula ot 
a-b where a is any integer between 1 to 8 1 2 of SEQ ID NO: 1 1 , b is an integer of 15 to 
826. where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:l 1. and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

15 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: tto ^t 
KPCCPSVSNRSSVQMHQLPIQFLGQFEAHCIGFCRSFLETFYTHDPRAMHSFL 

SSISSPSLPFGFSRMTSQINHLHPSPLC (SEQ ID NO:315). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
->0 This sene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune d.sorders, particularly neutropenia. Similarly, polypeptides and 
25 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type. s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e "immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g. 
30 lymph, serum, plasma, urine, synovial flu.d and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
35 NO: 161 as residues: Asp- 15 to Tyr-21. Pro-29 to Asn-39. 

The tissue distribution in neutrophils indicates that polynucleotides ana 
polypeptides corresponding to this gene are useful for immune disorders. Moreover. 
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the expression of this gene product indicates a role in the regulation of the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell Uncages, including 
blood stem cells. This gene product may be involved in the regulation of cytokine 
production, antigen presentation, or other processes that may also suggest a usefulness 
5 in the treatment of cancer (e.g. by boosting immune responses). Since the gene is 
expressed in cells of lymphoid origin, the natural gene product may be involved in 
immune functions. Therefore it may be also used as an agent for immunological 
disorders including arthritis, asthma, immunodeficiency diseases such as AIDS, 
leukemia, rheumatoid arthritis, granulomatous disease, inflammatory bowel disease, 

10 sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell 
mediated cytotoxicity; immune reactions to transplanted organs and tissues, such as 
host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoimmune infertility, lense tissue injury, demyelination, systemic lupus 
erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, 

15 scleroderma and tissues. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

20 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 12 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

25 Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 510 of SEQ ID NO: 12. b is an integer of 15 to 
524, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 12, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

In specific embodiments, polypeptides of the invention comprise the following 
35 amino acid sequence: SVFKINLKSFKQHEPWWPNRS (SEQ ID NO:316). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 



30 
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This °ene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnos.s of diseases and conditions wh.ch include, but are 
not limited to, immune disorders, including neutropenia. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number ot disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain Ussues or cell types 
(e - immune, hematopoietic, and cancerous and wounded ussues) or bodily fluids (e.g. 
lymph serum, plasma, urine, synov.al fluid and spinal fluid) or another tissue or cell 
«.mnle taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy t.ssue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO- 162 as residues: Met-1 to Arg-8, Leu-35 to Glu-41. 

The tissue distribution in neutrophils indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for immune disorders. More 
specifically, expression of this gene product indicates a role in the regulation of the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell lineages, 
including blood stem cells. This gene product may also be involved in the regulation of 
cytokineVoduction, antigen presentation, or other processes that may also suggest a 
usefulness in the treatment of cancer (e.g. by boosting immune responses). Since the 
gene is expressed in cells of lymphoid origin, the natural gene product may be involved 
in immune functions. Therefore it may be also used as an agent for immunological 
disorders includ.ne arthritis, asthma, immunodeficiency d.seases such as AIDS, 
leukemia, rheumatoid arthritis, granulomatous disease, inflammatory bowel disease, 
sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell 
mediated cytotoxicity; immune reactions to transplanted organs and tissues, such as 
host-versus-eraft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoimmune infertility, lense tissue injury, demyehnat.on, systemic lupus 
ervthematosis. dnig induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, 
scleroderma and tissues. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and in 
35 the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 



25 



30 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 1 3 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 477 of SEQ ID NO: 13, b is an integer of 15 to 
491, where both a and b correspond to the positions of nucleotide residues shown in 
10 SEQ ID NO: 13, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

15 This gene is expressed primarily in IL-1 and LPS induced neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune disorders, including ncutrophenia. Similarly, polypeptides and 

20 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g. immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g. 

25 lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 

30 NO: 163 as residues: Asn-45 to Thr-58. 

The tissue distribution in neutrophils indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and/or prevention 
of a variety of immune disorders. In particular, this gene product may play a role in 
regulating the proliferation; survival; differentiation; and/or activation of hematopoietic 

35 cell lineages, including blood stem cells. Furthermore, this gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 



BNSDOCID: <WO 9922243A1 I > 



• PCTYUS98/22376 

1 1 

boosiine immune responses). Since the gene us expressed in cells of lymphoid origin, 
the natural sene product may be involved in immune functions. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, 
immunodeficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
5 granulomatous disease, inflammatory bowel disease, seps.s, acne, neutropenia, 

neutrophilia, psonas.s, hypersensitivities, such as T-cell mediated cytotoxicity; immune 
reactions to transplanted organs and tissues, such as host-versus-graft and gralt-versus- 
host diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury demyelinat.on, systemic lupus erythematous, drug induced hemolytic anemia, 
10 rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, th.s gene 
product may have commercial utility in the expansion of stem cells and committed 
nrooenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
15 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO- 14 and may have been publicly available prior to conception of the present 
invention Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
-><> Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b where a is any integer between 1 to 389 of SEQ ID NO: 14, b is an integer of 15 to 
403, where both a and b correspond to the positions of nucleotide residues shown ,n 
SEQ ID NO: 14, and where b is greater than or equal to a + 14. 



25 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: _. r _^^ I _ 
GTRSFSVPSYLRLTGSLMCYLLLLLIQTAELLIHPQGLQAVSNGESALKGTRPTF 

SSPFILVTEGRKEWEGVFLSSGWKGNTLSNYYISLVFYYSRILQPYFYCLWGK 
LEMVTLIRSVWRGINGGDKISVGFGKC (SEQ ID NO:317). WMERKHTVKLL 
YLLGFLLQNSPAIFLLSMGEVGDGDLD (SEQ ID NO:318) SNGESALKGTRP 
TFSSPFILVTE (SEQ ID NO:319), anchor LSNYYISLVFYYSRILQPYFYCLW (SEQ 
ID NO- V0) Polynucleotides encoding these polypeptides are also encompassed by the 
invention The 2 ene encoding the disclosed cDNA is believed to reside on chromosome 
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17. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 17. 

This gene is expressed primarily in the breast and brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune, reproductive, or neural disorders, such as cancers of the breast, 
lymph system and brain. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

10 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive, immune, and central nervous systems, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g. immune, reproductive, neural, breast, and cancerous and wounded tissues) 
or bodily fluids (e.g. lymph, breast milk, serum, plasma, urine, synovial fluid and 

15 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 164 as residues: Leu-3 1 to Phe-38, Glu-47 to Trp-52. 

20 The tissue distribution in breast and brain tissue indicates that polynucleotides 

and polypeptides corresponding to this gene are useful for the diagnosis and treatment 
of cancers in the breast, lymph system, and brain. Moreover, the protein product of this 
gene may be useful for the detection/treatment of neurodegenerative disease states, 
behavioural disorders, or inflamatory conditions such as Alzheimers Disease, 

25 Parkinsons Disease, Huntingtons Disease, Tourette Syndrome, meningitis, 

encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, trauma, 
congenital malformations, spinal cord injuries, ischemia and infarction, aneurysms, 
hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive compulsive 
disorder, panic disorder, learning disabilities, ALS, psychoses , autism, and altered 

30 bahaviors, including disorders in feeding, sleep patterns, balance, and preception. In 

addition, elevated expression of this gene product in regions of the brain indicates that it 
plays a role in normal neural function. Potentially, this gene product is involved in 
synapse formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Moreover, the gene or gene product may also play a role in 

35 the treatment and/or detection of developmental disorders associated with the 

developing embryo, sexually-linked disorders, or disorders of the cardiovascular 
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system. Protein, as well as, ant.bod.es d.rected aga.nst the prote.n may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO- 1 5 and may have been publicly available pr.or to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
According, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formu.a ot 
a-b where a is any integer between 1 to 799 of SEQ ID NO: 15, b is an integer of 15 to 
813, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 15. and where b is greater than or equal to a + 14. 



15 



20 



25 



FEATURES OF PROTEIN ENCODED BY GENE NO: 6 



This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the t.ssue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune d.sorders, such as neutropenia. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e g immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g. 
Wmph serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e.. the expression level in healthy t.ssue or bodily fluid from an 
30 individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 

NO: 165 as residues: Ser-49 to Leu-54. 

The tissue distribution in neutrophils indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis, treatment, and/or 
prevention of a variety of immune d.sorders. Moreover, the expression of this gene 
product indicates a role in regulating the proliferation: survival: differentiation: and/or 
activation of hematopoietic cell lineages, including blood stem cells. Th.s gene product 



35 
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may be involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). Since the gene is expressed in cells of lymphoid origin, 
the natural gene product may be involved in immune functions. Therefore it may be also 
5 used as an agent for immunological disorders including arthritis, asthma, 
immunodeficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
granulomatous disease, inflammatory bowel disease, sepsis, acne, neutropenia, 
neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated cytotoxicity; immune 
reactions to transplanted organs and tissues, such as host-versus-graft and graft-versus- 

10 host diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 

15 various cell types. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 16 and may have been publicly available prior to conception of the present 

20 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 250 of SEQ ID NO: 16. b is an integer of 15 to 

25 264, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 16, and where b is greater than or equal to a -h 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

30 

The translation product of this gene shar es sequence homology with neurotoxin 
which is thought to be important in neural diseases. In specific embodiments, 
polypeptides of the invention comprise the following amino acid sequence: 
EKDFMQGSDAGHGGTHIYRALVQWPLAWVFYLSHAKTHWGEELRFSFRRKN 
35 LRLREAMRHETCQVTQLVA GKADSNLCLRDSETWFWPPLWAACSSLQATA 

CRLSSPSKGLGASRECPWLASGRAALVSFL (SEQ ID NO:321), SLRVKGRKPR 
LLYHSPARGTLWMLPGLCDCL ICRQWLVERSRLPRVGARTRFQSP SDTGWS 
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25 



QLCQLPAV ,SEQ ID NO-.322). and/or ERSRLPRVGARTRFQSPSDTGWSQLC 
(SEQ ID NO:323). Polynucleotides encoding these polypeptides are also encompassed 

by the invention. 

This »enc is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are usetul as 
reacts for differential identification of the tissue(s) or cell ,ype(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
no, „m,,ed to. immune and neural diseases, particularly neurodegenerative disorder , 
uch a, Alzheimers or Parkinson's, Similarly, poly^ptides and ant.bod.es directed ,0 
these polypeptides are useful ,n providing immunolog.cal probes for differential 
ident, icln of the tissuefsi or eel, type(s). For a number of disorders of the a ove 

.: ,u ™„,c„,arlv of the immune and neural systems, expression of this gene 

7s g ific ntly'highcr or lower levels may be detected in certain tissues or cell types 
,e,g mmune. hemaopoietic. neural, and cancerous and wounded tissues or boddy 
fluids ,«.«. ,ymph, sen,m, plasma, ur.ne, synovia, fluid and spinal fluid) or anohe, 
tls ,ue Ir cell sample taken from an individual having such a disorder 
standard gene expression level, i.e., the expression level ,n healthy tissue or bc4,ly 
fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown m SEQ ID 
20 NO:166asresiducs:Gln-2toGly-10,Asp-77.oPhe-82. 

The tissue distribution ,n neutrophils combined with the homology to the 
conserved neurotoxin protein indicates that polynucleotides and polypeptides 
corresponds to this gene are useful for immune and neural diseases. Similarly, the 
protein product of this gene may be useful for the detection/treatment of 
neurodegenerative disease states, behavioural disorders, or inflatory condmons such 
as AlJLers Disease, Parkinsons Disease, Huntingtons Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms/hemorrhages. schizophrenia, mama, dementia, paranoia, obsessive 
compulsive disorder, panic disorder, learning disabilities, ALS, psychoses . autism, 
and altered hahaviors. including disorders in feeding, sleep patterns. 
prcccption. In addition, elevated expression of this gene product ,n regions of ,h b rain 
ndJtes tha, ,« plays a role ,n norma, neural function. Potentially, this gene product ,s 
involved ,n synapse formation, neurotransmission, learning, cognition, homeostas , o 
i,,,,™,,,! differentiation or survival.Moreover. the gene or gene product may also play a 
roll ,"n the treatment and/or detection of developmental disorders associated with tne 
develop™ embryo, sexually-linked disorders, or disorders of the cardiovascular 
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system. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
5 ID NO: 17 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
10 a-b, where a is any integer between 1 to 506 of SEQ ID NO: 17. b is an integer of 15 to 
520, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 17, and where b is greater than or equal to a + 14. 



15 FEATURES OF PROTEIN ENCODED BY GENE NO: 8 



In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: KHAFLMAHQFCVLSLAMQWSSCFQLVALPYLSL (SEQ ID 
NO:324). Polynucleotides encoding these polypeptides are also encompassed by the 
20 invention. 

This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

25 not limited to, immune disorders, such as neutropenia. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 

30 (e.g. immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g. 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

35 The tissue distribution in neutrophils indicates that polynucleotides and 

polypeptides corresponding to this gene arc useful for the diagnosis, treatment, and/or 
prevention of immune disorders. Furthermore, this gene product may be involved in the 
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regulation of cytokine production, antigen presentation, or other processes that may 
also surest a usefulness in the treatment of cancer (e.g. by boosting .mmune 
responds). Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in .mmune functions. Therefore it may be also used as an 
5 agent for immunological disorders including arthritis, asthma, immunodelicency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
10 diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

injury demyelination, systemic lupus erythematosus, drug induced hemolyt.c anem.a, 
^, m ., n iH arthritis. Siren's disease, scleroderma and tissues. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
15 various cell types. Protein, as well as. antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO- 1 8 and may have been publicly available prior to conception of the present 
20 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b where a is any integer between 1 to 979 of SEQ ID NO: 1 8. b is an integer of 15 to 
25 993, where both a and b correspond to the positions of nucleotide residues shown ,n 
SEQ ID NO: 18, and where b is greater than or equal to a + 14. 



30 



15 



FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

When tested against Jurkct and PC 12 cell lines, supernatants removed from 
cells containing this gene activated the GAS (gamma activating sequence) and EGR1 
(early growth response gene 1) promoter elements. Thus, it is likely that this gene 
activates T-cells and sensory neurons through the JAK-STAT and EGR1 signal 
transduction pathways. GAS is a promoter element found upstream of many genes 
which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, signal 
transduction pathway involved in the differentiation and proliferation of cells. 
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Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and differentiation 
of cells. EGR1 is a separate signal transduction pathway from Jak-STAT, genes 
containing the EGR1 promoter are induced in various tissues and cell types upon 
5 activation, leading the cells to undergo differentiation and proliferation. In specific 
embodiments, polypeptides of the invention comprise the following amino acid 
sequence: MRPLCVLLPWPCWQWGGLGSASPIRPQAPPGQAAHAVP 
LPRAQHLAQRSRQ (SEQ ID NO:325). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. The gene encoding the disclosed cDNA is 
10 believed to reside on chromosome 17. Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 17. 

This gene is expressed primarily in breast, lymph nodes, spleen, and to a lesser 
extent, in liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. reproductive, immune, or hepatic disorders, particularly cancers of the 
breast, liver, and lymph system. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
20 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the breast, liver and lymph system, expression of this 
gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g. reproductive, breast, immune hematopoietic, hepatic, and cancerous and 
wounded tissues) or bodily fluids (e.g. lymph, breast milk. bile, serum, plasma, urine. 
25 synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
30 NO: 168 as residues: Pro-54 to Gly-67. 

The tissue distribution in breast and immune tissues combined with the detected 
EGR1 and GAS biological activity indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of cancers of the 
breast and lymph systems. Moreover, the GAS and EGR1 activity strongly indicates 
35 that the protein product of this gene may play an integral role in the regulation of cellular 
division, and may show utility in the diagnosis and treatment of cancer and other 
proliferative disorders. Similarly, such proliferative tissues rely on finely regulated 
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decisions involving cell differentiation anchor apoptosis. Thus this protein may also be 
involved in regulating apoptosis or tissue differentiation and, thus could be useful in 
cancer therapy. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

5 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 19 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

10 Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 445 of SEQ ID NO: 19, b is an integer of 15 to 
459, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 19, and where b is greater than or equal to a + 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

In specific embodiments, polypeptides of the invention comprise the following 
20 amino acid sequence: ARGLRSPHGAAGVVRGDGGGKKGEDPYSPILFQ 
SERIPRLIYLPVISSEENS (SEQ ID NO:326). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

This gene is expressed primarily in an LPS induced neutrophil cDNA library. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune system disorders, such as neutropenia. Similarly, polypeptides 
and antibodies directed to these polypeptides arc useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
30 disorders of the above tissues or cells, particularly of the immune system, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g.immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
35 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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The tissue distribution in neutrophils indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
immune disorders, for example, in ameliorating an abberant neutrophil reponse to 
infectious agents. Similarly, the expression of this gene product may suggest a role in 
5 regulating the proliferation; survival; differentiation; and/or activation of hematopoietic 
cell lineages, including blood stem cells. This gene product may also be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). Since the gene is expressed in cells of lymphoid origin, the natural gene 

10 product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 

15 transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 

20 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

25 ID NO:20 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

30 a-b, where a is any integer between 1 to 541 of SEQ ID NO:20, b is an integer of 15 to 
555, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:20, and where b is greater than or equal to a + 14. 



35 



BNSDOCID: cWO 9922243A1 I > 



WO 99/22243 




PCT/US98/22376 



21 



FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

This gene is expressed primarily in prostate cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive or immune system disorders, particularly prostate cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
1 0 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g.reproductive, prostate, and cancerous and 
wounded tissues) or bodily fluids (e.g. lymph, seminal fluid, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
15 having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 170 as residues: Pro-14to Asp-25, Leu-51 to Val-63. 

20 The tissue distribution in prostate tissues indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
reproductive system disorders such as cancer, particularly prostate cancer. Similarly, 
the expression within prostate cancer tissue, a cellular source marked by proliferating 
cells, indicates that this protein may play a role in the regulation of cellular division, and 

25 may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders not limited to prostate tissue. Further, such tissues rely on decisions 
involving cell differentiation and/or apoptosis. Thus this protein may also be involved 
in apoptosis or tissue differentiation and could again be useful in cancer therapy. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 

30 marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:2 1 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

35 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or mote 
polynucleotides comprising a nucleotide sequence described by the general formula of 
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a-b, where a is any integer between 1 to 651 of SEQ ID NO:21, b is an integer of 15 to 
665, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:2 1 , and where b is greater than or equal to a + 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

The polynucleotide sequence of this gene may have a frame shift. Therefore the 
preferred signal peptide may reside in a frame other than the associated polynucleotides 
10 of this gene. In specific embodiments, polypeptides of the invention comprise the 
following amino acid sequence: 

KSLSCSFLFLAFWLRRMGQTMCVCVCVCVCVCVRTWVYLYEPVKF RSPLIYV 
NLPTS (SEQ ID NO:327), and/or KLGFTMLARLVSNSXTSGDLPSSASQNAGI 
KGMSYRAWPYSYFLIRKNKQT NKQTKTNPQLGENKHCRNLKVSWSKNYFL 
15 (SEQ ID NO:328). Polynucleotides encoding these polypeptides are also encompassed 
by the invention. 

This gene is expressed primarily in T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions which include, but are 

not limited to, immune system disorders, particularly immunodeficiencies such as lupus 
and AIDS, or inflammatory disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

25 tissues or cells, particularly of the immune system, expression of this gene at 

significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g. immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g. 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 

30 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in immune cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of a 
variety of immune system disorders. Moreover, this gene product may play a role in 

35 regulating the proliferation; survival; differentiation; and/or activation of hematopoietic 
cell lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that mav 
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also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

5 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity, immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

10 injury, demyelination, systemic lupus erythematosis. drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may show 

15 utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:22 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

20 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 763 of SEQ ID NO:22. b is an integer of 15 to 
777. where both a and b correspond to the positions of nucleotide residues shown in 

25 SEQ ID NO:22, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

30 When tested against Jurket and U937 cell lines, supernatants removed from 

cells containing this gene activated the GAS (gamma activating sequence) promoter 
element. Thus, it is likely that this gene activates T-cells and promyelocyte cells 
through the JAK-STAT signal transduction pathway. GAS is a promoter element found 
upstream of many genes which are involved in the Jak-STAT pathway. The Jak-STAT 

35 pathway is a large, signal transduction pathway involved in the differentiation and 

proliferation of cells. Therefore, activation of the Jak-STAT pathway, reflected by the 
binding of the GAS element, can be used to indicate proteins involved in the 
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proliferation and differentiation of cells. In specif ic embodiments, polypeptides of the 
invention comprise the following amino acid sequence: 
ERGQGGSSRNVAGSDLVFPAVFVSXLC (SEQ ID NO:329), 

GSPQGPSVALGSRQCWSRPLRRGGRGAAVEMWRGPTWCFRPSLCLCCVCGV 
5 SFGLYVPHGFSLSMCVSAP GSAWLSLVYSICLARGSMSXRXSSRXSLV 

ASGASVLLVCFWVXADPGVGVSVPRAXVSGLWWCVSPSACLXLAPTKPPP 
XLSFSLSIFPFSSNPSK (SEQ ID NO:330), and/or TIASLQPTALNHLIWRGW 
KRKGRLRERKRGXGGAWLGPXRGRQMDSHTTRDQRQXLGEQRHPLLGLXA 
PRSKPTKQMPQMQPGXPEKKXXLTWNHGLDRWNTQGTARQSLGQK 

10 HTWRD (SEQ ID NO:33 1). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in adipose tissue and the brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, metabolic or neural disorders, particularly obesity, and 
neurodegenerative or central nervous system disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

20 of the above tissues or cells, particularly of the brain and central nervous system, 

expression of this gene at significantly higher or lower levels may be detected m certain 
tissues or cell types (e.g. adipose, neural, immune, and cancerous and wounded tissues) 
or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

25 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in adipose and neural tissues, combined with the detected 
GAS biological activity indicates that polynucleotides and polypeptides corresponding 
to this gene are useful for diagnosis and treatment of obesity and disorders of the brain 

30 and central nervous system. Similarly, the protein product of this gene may be useful 
for the detection/treatment of neurodegenerative disease states, behavioural disorders, 
or mflamatory conditions such as Alzheimers Disease, Parkinsons Disease, 
Huntingtons Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 

35 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder, panic disorder, learning 
disabilities, ALS, psychoses . autism, and altered bahaviors. including disorders in 
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feeding, sleep patterns, balance, and preception. In addition, elevated expression of this 
gene product in regions of the brain indicates that it plays a role in normal neural 
function. Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
5 survival. Moreover, the gene or gene product may also play a role in the treatment 
and/or detection of developmental disorders associated with the developing embryo, 
sexually-linked disorders, or disorders of the cardiovascular system. In addition, the 
protein product of this gene may also be beneficial in detecting, treating, or preventing 
neural disorders which occur secondary to aberrant fatty acid metabolism in neural 
10 tissues, such as for aberrations in myelin sheath development, or associated 

autoimmune disorders of neural tissue or the overlying integument. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
15 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:23 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
20 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 526 of SEQ ID NO:23, b is an integer of 15 to 
540, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:23, and where b is greater than or equal to a + 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: ARGPGTEGCEPWLQLQDRRER (SEQ ID NO:332), and/or 
30 MSSGTNSFFTLMALNSPTGDSGSRITVSPPRVHPVKSGRGRASDLLLTRFLAPR 
SALWS (SEQ ID NO:333). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in a cDNA library from IL-1 and LPS induced 

neutrophils. 

35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but arc 
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not limited to, immune system disorders, such as neutropenia. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell typc(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression of 
5 this gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g. immune, cancerous and wounded tissues) or bodily fluids (e.g. lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

10 individual not having the disorder. 

The tissue distribution in immune cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
conditions where lymphocytes show abberant response to an infectious agent. 
Similarly, this gene product may play a role in regulating the proliferation; survival; 

15 differentiation; and/or activation of hematopoietic cell lineages, including blood stem 
cells. This gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). Since the gene is expressed 
in cells of lymphoid origin, the natural gene product may be involved in immune 

20 functions- Therefore it may be also used as an agent for immunological disorders 
including arthritis, asthma, immunodeficiency diseases such as AIDS, leukemia, 
rheumatoid arthritis, granulomatous disease, inflammatory bowel disease, sepsis, acne, 
neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated 
cytotoxicity; immune reactions to transplanted organs and tissues, such as host-versus- 

25 graft and graft-versus-host diseases, or autoimmunity disorders, such as autoimmune 
infertility, lense tissue injury, demyelination, systemic lupus erythematosis, drug 
induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, scleroderma and 
tissues. In addition, this gene product may have commercial utility in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the 

30 differentiation and/or proliferation of various cell types. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

35 ID NO:24 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
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Accordingly, preferably excluded from the present invention arc one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 470 of SEQ ID NO:24, b is an integer of 1 5 to 
484, where both a and b correspond to the positions of nucleotide residues shown in 
5 SEQ ID NO:24, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

10 This gene is expressed primarily in ovaries, tonsils, and CD34 positive bone 

marrow stem cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
15 not limited to, reproductive, immune, developmental, or hematopoietic disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and reproductive systems, expression of this gene at significantly higher or 
20 lower levels may be detected in certain tissues or cell types (e.g. reproductive, ovarian, 
immune, tonsil, umbilical, developmental, and cancerous and wounded tissues) or 
bodily fluids (e.g. lymph, amniotic fluid, serum, plasma, urine, synovial fluid and 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e.. the expression level in 
25 healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in ovarian and tonsil tissue indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment ol 
immune and reproductive system disorders. Similarly, expression of this gene product 
in tonsils indicates a role in regulating the proliferation; survival; differentiation; and/or 
30 activation of hematopoietic cell lineages, including blood stem cells. This gene product 
may be involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). Since the gene is expressed in cells of lymphoid origin, 
the natural gene product may be involved in immune functions. Therefore it may be also 
35 used as an agent for immunological disorders including arthritis, asthma, 
immunodeficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
granulomatous disease, inflammatory bowel disease, sepsis, acne, neutropenia. 
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neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated cytotoxicity; immune 
reactions to transplanted organs and tissues, such as host-versus-graft and graft-versus- 
host diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelmation, systemic lupus erythematosis, drug induced hemolytic anemia, 
5 rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:25 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

15 Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 693 of SEQ ID NO:25, b is an integer of 15 to 
707, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:25, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

When tested against U937 cell lines, supernatants removed from cells 
25 containing this gene activated the GAS (gamma activating sequence) promoter element. 
Thus, it is likely that this gene activates promyelocyctic cells through the JAK-STAT 
signal transduction pathway. GAS is a promoter element found upstream of many 
genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells. 
30 Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 

element, can be used to indicate proteins involved in the proliferation and differentiation 
of cells. In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: HEYHLLSSRHILGSVLRLDVC SALWS (SEQ ID NO:334). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
35 This gene is expressed primarily in IL-1 and LPS induced neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 



BNSDOCID: <WO 9922243A1 I 



WO 99/22243 




PCTYUS98/22376 



29 



biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders, such as neutropenia. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
5 of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g. immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g. 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
10 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in immune cells combined with the detected GAS 
biological activity indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for the diagnosis, treatment, and/or prevention of immune disorders. 
15 Specifically, the expression of this gene product in neutrophils indicates a role in 

regulating the proliferation; survival; differentiation; and/or activation of hematopoietic 
cell lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
20 responses). Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
25 hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 

transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lensc tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, this gene 
30 product may have commercial utility in the expansion of stem cells and committed 

progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
35 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:26 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
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scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 779 of SEQ ID NO:26, b is an integer of 15 to 
793, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:26, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

This gene is expressed primarily in the spinal cord. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, any of a variety of nervous system and neuromuscular disorders, 
particularly amyotropic lateral sclerosis, musculuar dystrophy, and inherited and non- 
inherited forms of chorea. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nervous system and neuromuscular systems, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g. neural, neuromuscular, and cancerous and wounded tissues) or bodily fluids 
(e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in spinal cord tissue indicates that this gene could be used 
for the treatment of spinal cord and related injuries. The protein product of this gene 
could be injected into the spinal cord to promote or control growth following injuring or 
degeneration. Alternatively cells expressing this gene could be injected or transferred 
into the spinal cord by other means as a treatment promoting the regulation of growth 
following spinal cord injury or degeneration. Moreover, polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states, behavioural disorders, or inflamatory conditions such 
as Alzheimers Disease, Parkinsons Disease. Huntingtons Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction. 
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aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, panic disorder, learning disabilities, ALS, psychoses , autism, 
and altered bahaviors, including disorders in feeding, sleep patterns, balance, and 
preception. In addition, elevated expression of this gene product in regions of the brain 

5 indicates that it plays a role in normal neural function. Potentially, this gene product is 
involved in synapse formation, neurotransmission, learning, cognition, homeostasis, or 
neuronal differentiation or survival. Moreover, the gene or gene product may also play a 
role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, sexually-linked disorders, or disorders of the cardiovascular 

10 system. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:27 and may have been publicly available prior to conception of the present 

15 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 624 of SEQ ID NO: 27, b is an integer of 15 to 

20 638, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:27, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

25 

When tested against U937 cell lines, supernatants removed from cells 
containing this gene activated the GAS (gamma activating sequence) promoter element. 
Thus, it is likely that this gene activates promyelocytic cells through the JAK-STAT 
signal transduction pathway. GAS is a promoter element found upstream of many 
30 genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and differentiation 
of cells. In specific embodiments, polypeptides of the invention comprise the following 

35 amino acid sequence: 

FILFILEYDMLWKSLYTNSSAYGY V1AS YTCLLGIKLLVKQKKXKKKTRGGAR 
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X PIRPXVESYYKSXAVVLQRRGLGKNLGG (SEQ ID NO:335). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in the adrenal gland. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, endocrine disorders, particularly disorders of the adrenal gland. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 

10 type(s). For a number of disorders of the above tissues or cells, particularly of the 
adrenal gland, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g.endocrine, adrenal, and cancerous and 
wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such a 

15 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in adrenal tissue, combined with the detected GAS 
biological activity indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for the detection, treatment, and/or prevention of various endocrine 

20 disorders and cancers, particularly Addison's disease, Cushing's Syndrome, and 
disorders and/or cancers of the pancrease (e.g. diabetes mellitus), adrenal cortex, 
ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Protein, as well as, antibodies directed against the protein may show utility as a 

25 tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:28 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

30 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 514 of SEQ ID NO:28, b is an integer of 15 to 
528, where both a and b correspond to the positions of nucleotide residues shown in 

35 SEQ ID NO:28, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

This gene is expressed primarily in the placenta. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive system disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the reproductive system, expression of this 
^ene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g.placental, reproductive, and cancerous and wounded tissues) or bodily fluids 
(e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 

15 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 178 as residues: His- 15 to Trp-20, Pro-48 to Ala-54. 

20 The tissue distribution in placental tissue indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
reproductive disorders. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:29 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

30 Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 905 of SEQ ID NO:29, b is an integer of 15 to 
919, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:29, and where b is greater than or equal to a + 14. 



BNSDOCiD -WC 99222J3A1 l -> 



WO 99/22243 




PCT/US98/22376 



34 



FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

The translation product of this gene shares sequence homology with human 
erythrocyte membrane anion-transport protein which is thought to be important in 
5 autoimmune diseases. Furthermore, the translation product of this gene also has 
homology to a human gnllPIDId 1026838 (ABO12130) sodium bicarbonate 
cotransporter2 which is thought to be important in maintaining cellular homoestasis. 
Contact of cells with supernatant expressing the product of this gene was found to 
increase the permeability of the plasma membrane of enterocy tes and renal mesangial 

10 cells to calcium. Thus it is likely that the product of this gene is involved in a signal 

transduction pathway that is initiated when the product binds a receptor on the surface 
of the enterocytes and renal mesangial cells. Thus, polynucleotides and polypeptides 
have uses which include, but are not limited to, activating cellular processes within 
enterocytes and renal mesangial cells. In specific embodiments, polypeptides of the 

15 invention comprise the following amino acid sequence: 

RVSSHLFRLFGGLILDIKRKAPFFLSDFKDALSLQCLASILFLYCACMSPVITFG 
GLLGEATEG RIVSTKIGSGQAFSSSEASVCMHLSHYSYFYLKSLPTA (SEQ ID 
NO:336), FRLFGGLILDIKRKAPFFLSDFKD (SEQ ID NO:337), FLYCACMSPV 
ITFGGLLGEATEG (SEQ ID NO:338), and/or SSSEASVCMHLSHYSYFYLKSL 

20 (SEQ ID NO:339). Polynucleotides encoding these polypeptides are also encompassed 
by the invention. The gene encoding the disclosed cDNA is believed to reside on 
chromosome 3. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 3. 

This gene is expressed primarily in human testes tumor. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune or reproductive disorders, particularly autoimmune diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

30 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g. immune, reproductive, testicular, and 
cancerous and wounded tissues) or bodily fluids (e.g. lymph, seminal fluid, serum, 

35 plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
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level i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

The tissue distribution in testis, the homology to an erythrocyte membrane 
antion-transport protein, in addition to, the detected calcium flux biological activity 
5 indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the treatment of autoimmune diseases and other immune diseases such as cancer, 
particularly in, but not limited to, testicular tissue. Similarly, the translation product of 
this gene may be important in maintaining normal, cellular homeostasis. Therefore, the 
protein, as well as antibodies directed to the invention, is beneficial as a therapeutic in 

10 order to ameliorate conditions related to aberrant cellular pH regulation (for example, 
use antibodies to decrease the presence of the protein, or possibly in gene therapy 
applications in order to replace a defective form, or alternatively, increase the 
expression of either the endogenous or modified form of the invention). Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

15 and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:30 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

20 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 850 of SEQ ID NO:30, b is an integer of 15 to 
864, where both a and b correspond to the positions of nucleotide residues shown in 

25 SEQ ID NO:30, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

30 This gene is expressed primarily in the brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neural disorders, which include, but are not limited to, disorders of the 

35 brain and central nervous system, such as neurodegenerative conditions and/or 

depression. Similarly, polypeptides and antibodies directed to these polypeptides arc 
useful in providing immunological probes for differential identification of the tissue(s) 
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or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the brain and central nervous system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g. neural, cancerous and 
wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid 
5 and spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 180 as residues: His- 13 to Leu- 18. 

10 The tissue distribution in neural tissue indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
disorders of the brain and central nervous system. Moreover, polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states, behavioural disorders, or inflamatory conditions such 

15 as Alzheimers Disease, Parkinsons Disease, Huntingtons Disease. Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, panic disorder, learning disabilities, ALS, psychoses , autism, 

20 and altered bahaviors, including disorders in feeding, sleep patterns, balance, and 

preception. In addition, elevated expression of this gene product in regions of the brain 
indicates that it plays a role in normal neural function. Potentially, this gene product is 
involved in synapse formation, neurotransmission, learning, cognition, homeostasis, or 
neuronal differentiation or survival. Moreover, the gene or gene product may also play a 

25 role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, sexually-linked disorders, or disorders of the cardiovascular 
system. Protein, as well as. antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

30 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:3 1 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

35 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 905 of SEQ ID NO:3 1, b is an integer of 15 to 
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919, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:3 1, and where b is greater than or equal to a + 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: 

PCLQVIGIDFCRLLLMCLVLKRNLTVPFSSYSPLKTITCITSEQIAVVSNFFRQKL 
10 GVRAK FFQGACLHTSKVVICLNLPIISIQRADIRMWWLVVNTPYARGVNN 

(SEQ ID NO:340). Polynucleotides encoding these polypeptides are also encompassed 
by the invention. 

This gene is expressed primarily in the spinal cord. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, any of a variety of nervous system and neuromuscular disorders 
including, but not limited to, amyotropic lateral sclerosis, musculuar dystrophy, and 
inherited and non-inherited forms of chorea. Similarly, polypeptides and antibodies 

20 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system and neuromuscular 
systems, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g. neural, cancerous and wounded tissues) or bodily 

25 fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in spinal cord indicates that this gene could be used for 

30 the treatment of spinal cord injuries. The protein product of this gene could be injected 
into the spinal cord to promote or control growth following injuring or degeneration. 
Alternatively cells expressing this gene could be injected or transferred into the spinal 
cord by other means as a treatment promoting the growth or regulation of growth 
following spinal cord injury or degeneration. This gene may also be useful for the 

35 detection/treatment of neurodegenerative disease states, behavioural disorders, or 

inMamatory conditions such as Alzheimers Disease, Parkinsons Disease, Huntingtons 
Disease, Tourette Syndrome, meningitis, encephalitis, demyehnattng diseases. 



BNSDOCID <WO 9922243A1 I > 



WO 99/22243 




PCT/US98/22376 



38 



peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal cord 
injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder, panic disorder, learning 
disabilities, ALS, psychoses , autism, and altered bahaviors, including disorders in 
feeding, sleep patterns, balance, and preception. In addition, elevated expression of this 
gene product in regions of the brain indicates that it plays a role in normal neural 
function. Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Moreover, the gene or gene product may also play a role in the treatment 
and/or detection of developmental disorders associated with the developing embryo, 
sexually-linked disorders, or disorders of the cardiovascular system. Protein, as well 
as, antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 32 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 942 of SEQ ID NO:32. b is an integer of 15 to 
956, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:32, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: VVSVCVLETGQLGPAALCRSV (SEQ ID NO:341). 
Polynucleotides encoding these polypeptides arc also encompassed by the invention. 

This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue( s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune disorders, which include, nut are not limited to, inflammatory 
diseases or neutropenia. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
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of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g.immune, cancerous and 
wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid 
5 and spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in neutrophils indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treament of the 

10 inflammatory conditions. Moreover, This gene product may be involved in the 

regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 

15 agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 

20 diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 

25 various cell types. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 33 and may have been publicly available prior to conception of the present 

30 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 552 of SEQ ID NO:33, b is an integer of 15 to 

35 566, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:33, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: 

NISVHGFPVPCLRQRLQGPCHPKCCPHXISSGKPRSSFSPSSYHCKFSRNATLL 
VVPNIFSYMQSSFLIPQTSKYYILXPYAXTXRPIKXIFKQAKQ (SEQ ID NO:342), 
IYNDMMMEKKKTEVYQKRXSGDNTWGGKGLVAFVSSMEQGIHVQRCFIANL 

10 KFSSPGV (SEQ ID NO: 343), YDDGEKEDRGLPEEMXWGQHLGWQGPCSL 

CLKHGTGNPCTEMFYCQFKIFISWCLIPLVFARLGDFRDRPGWIFSWRYHLKH 
TVWGGYNIIML (SEQ ID NO: 344), TPGDENFKLAIKHLCTWIPCS (SEQ ID NO: 
345), IRHEIFLTIESFCPS APRGEDDDNLLRTSRVPDI (SEQ ID NO:346), IRGSIP 
GHKKMHLS FNVAAQWSLLKPLVLREEGALFLTHDQLESKNSWTLSIGPRV 

15 PYTYVVVTWSSALWDLPNQPLAGRKESGGSYGPISVTQSPHQAALKWFAKK 
KGKQSHSTVQLANILHVFXAPDXYHFVNTSLQLFLEYTVMCMLCENK QKT 
LGR (SEQ ID NO:347), EPEVTQVXSXELTFQ PRKAGAKVTAGKSHHQVIHWE 
FEIMLSSYSTDVPLWFLKFFSSNLPQTYFPHSGVKKWGSCFSLPWRDSPPLT 
FISLLSSHLTTFHLYHLHHGIICLGFSVYFHRAYTSLCILETAVGSY (SEQ ID 

20 NO:348), WSLLKPLVLREEGALFLTHDQLESK (SEQ ID NO:349), WFAKKK 

GKQSHSTVQLANILHV (SEQ ID NO:350). AGKSHHQVIHWEFEIMLSSYSTDVP 
(SEQ ID NO:351), and/or HGIICLGFSVYFHRAYTSLCILETAV (SEQ ID NO:352). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in smooth muscle. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, muscular or vascular disorders, which include, but are not limited to 
defective organ innervation; deficiencies in neuronal survival; peristaltic abnormalities; 

30 digestive disorders; perturbations of the vasculature. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the smooth muscle, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 

35 (e.g., cancerous and wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in smooth muscle tissue indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the diagnosis and/or 
5 treatment of disorders that result from failures of normal smooth muscle function. For 
example, this gene product may represent a soluble factor produced by smooth muscle 
that regulates the innervation of organs or regulates the survival of neighboring 
neurons. Likewise, it may be involved in controlling the digestive process, and such 
actions as peristalsis. Similarly, it may be involved in controlling the vasculature in 

10 areas where smooth muscle surrounds the endothelium of blood vessels. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

15 ID NO:34 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

20 a-b, where a is any integer between 1 to 1550 of SEQ ID NO:34, b is an integer of 15 
to 1564, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:34, and where b is greater than or equal to a -f 14. 



25 FEATURES OF PROTEIN ENCODED BY GENE NO: 25 



In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: KRLTINARVHLWTLKSVPL (SEQ ID NO:353), EYVFNMX 
XYSKSRAISPLSGPYTPRGTTPLPIIPEPGARQRDHPAS LKYAKIIQTKLFAL 
30 PYPKETSMKAVA (SEQ ID NO:354), and/or ETVPPRSSQFLKITXGPARSMSLIX 
XAIQNPEPYLLYLALIPQEALLLYLSSQSQVPGNETTPPV (SEQ ID NO:355). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune system disorders, which include, but are not limited to lupus. 
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inflammatory conditions, and immunodeficiencies such as AIDS. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell typc(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system, 
5 expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types (e.g. immune, hematopoietic, and cancerous and wounded tissues) 
or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

10 fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 184 as residues: Ser-21 to Thr-34. Thr-38 to Glu-43. 

The tissue distribution in immune cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of a 

15 variety of immune system disorders. Expression of this gene product in T-cells 

indicates a role in regulating the proliferation: survival; differentiation; and/or activation 
of hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 

20 boosting immune responses). Since the gene is expressed in cells of lymphoid origin, 

the natural gene product may be involved in immune functions. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, 
immunodeficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
granulomatous disease, inflammatory bowel disease, sepsis, acne, neutropenia, 

25 neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated cytotoxicity; immune 
reactions to transplanted organs and tissues, such as host-versus-graft and graft-versus- 
host diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. In addition, this gene 

30 product may have commercial utility in the expansion of stem cells and committed 

progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

35 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:35 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
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scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1021 of SEQ ID NO:35, b is an integer of 15 
5 to 1035, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:35, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

10 

The translation product of this gene was determined to have homology to the 
human TB3089A protein which is thought to play an important role in tumor 
suppression (See Genbank Accession No.gii304i 877 (AF027734)).In specific 
embodiments, polypeptides of the invention comprise the following amino acid 

1 5 sequence: NEVSFSLSLGFSPREFARWKVNNLAL ERKDFFSLPLPLAPEFIRNI 
RLLGRRPNLQQVTENLIKKYGTHFLLSATLGGKQHHNPKLIGCQTIGNNV 
KTRVA (SEQ ID NO:356), VPYFLIRFSVTCCRLGLLPRRRMFRIN SGARGNG 
KLKKSFLSRAK LFTFQRANSLGEKPRDKEKLTSFQSKRHKI (SEQ ID NO:357). 
and/or EMSAVLFNQIFCNLLQIGSPSKEANVPDKLWGKRQWQTEEVLPFQSQV 

20 VHLPTGKLPGGKAKG (SEQ ID NO:358). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

This gene is expressed primarily in human fibrosarcoma. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders afflicting endothelial, muscular, and extracellular matrix 
tissues, which include, but are not limited to fibrosarcomas and bladder cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 

30 type(s). For a number of disorders of the above tissues or cells, particularly of the 
integumentary system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g.endothelial. urogenitahrenal, 
muscular, and cancerous and wounded tissues) or bodily fluids (e.g. lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 

35 from an individual having such a disorder, relative to the standard gene expression 

level, i.e., the expression level in healthy tissue or bodily fluid irorn an individual not 
having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 185 as residues: Pro-49 to Asp-68. 

The tissue distribution in human fibrosarcoma indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
5 various cancers, particularly fibrosarcomas and fibroids. Moreover, the expression 

within cellular sources marked by proliferating cells indicates that this protein may play 
a role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, developmental tissues 
rely on decisions involving cell differentiation and/or apoptosis in pattern formation. 
10 Thus this protein may also be involved in apoptosis or tissue differentiation and could 
again be useful in cancer therapy. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker ancl/or immunotherapy targets for the above 
listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
15 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:36 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
20 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 606 of SEQ ID NO:36, b is an integer of 15 to 
620, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:36, and where b is greater than or equal to a 4- 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 



This gene is expressed primarily in human tonsil. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune or hematopoietic disorders, which include, but are not limited to 
inflammation and infectious diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
35 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
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(e.g. immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g. 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder. 

The tissue distribution in tonsils indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis of inflammation and infectious 
diseases. Moreover, this gene product may play a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 

10 blood stem cells. This gene product may be involved in the regulation of cytokine 

production, antigen presentation, or other processes that may also suggest a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). Since the gene is 
expressed in cells of lymphoid origin, the natural gene product may be involved in 
immune functions. Therefore it may be also used as an agent for immunological 

15 disorders including arthritis, asthma, immunodeficiency diseases such as AIDS, 

leukemia, rheumatoid arthritis, granulomatous disease, inflammatory bowel disease, 
sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell 
mediated cytotoxicity; immune reactions to transplanted organs and tissues, such as 
host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 

20 autoimmune infertility, lense tissue injury, demyelination, systemic lupus 

erythematosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, 
scleroderma and tissues. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Protein, as well as, 

25 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:37 and may have been publicly available prior to conception of the present 

30 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 959 of SEQ ID NO:37, b is an integer of 15 to 

35 973, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:37, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: 

HYHGSGFLIKEFGSFLSLLCMLSCPYVFCHGMLEQEVPSSVVSPSTLDF PTSR 
TVNKFLFKLPSLWYSVIATQNGLKQKJRETFLFVQFSQMPRWHKLE (SEQ ID 
NO:359). Polynucleotides encoding these polypeptides are also encompassed by the 

10 invention. 

This gene is expressed primarily in adipose and brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

15 not limited to. metabolic or neural conditions, which include but are not limited to 

obesity and disorders of the brain and central nervous system. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the central nervous system, 

20 expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types (e.g. neural, metabolic tissues, and cancerous and wounded tissues) 
or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodily 

25 fluid from an individual not having the disorder. 

The tissue distribution in neural and adipose tissue indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
obesity and disorders of brain and central system. Moreover, polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 

30 neurodegenerative disease states, behavioural disorders, or inflamatory conditions such 
as Alzheimers Disease, Parkinsons Disease, Huntingtons Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 

35 compulsive disorder, panic disorder, learning disabilities, ALS, psychoses , autism, 
and altered bahaviors, including disorders in feeding, sleep patterns, balance, and 
prcception. In addition, elevated expression of this gene product in regions of the brain 
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indicates that it plays a role in normal neural function. Potentially, this gene product is 
involved in synapse formation, neurotransmission, learning, cognition, homeostasis, or 
neuronal differentiation or survival. Moreover, the gene or gene product may also play 
a role in the treatment and/or detection of developmental disorders associated with the 
5 developing embryo, sexually-linked disorders, or disorders of the cardiovascular 
system. In addition, considering the expression within both adipose tissue and brain 
indicates that the protein may be benefical cither as a target for gene therapy, or as a 
novel therapeutic to ameliorate conditions affecting myelin sheath development in 
neurons, or other disorders involving neural tissue which occur secondary to aberrant 
10 fatty-acid metabolism. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

15 ID NO:38 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

20 a-b, where a is any integer between 1 to 824 of SEQ ID NO:38, b is an integer of 15 to 
838, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:38, and where b is greater than or equal to a + 14. 



25 FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

The gene encoding the disclosed cDNA is thought to reside on chromosome 1 1. 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 11. One embodiment of this gene comprises polypeptides of 

30 the following amino acid sequence: 

FCKHNGSKNVFSTFRTPAVLFTGIVALYIASGLTGFIGLEVVAQLFNC (SEQ ID 
NO:360). An additional embodiment is the polynucleotides encoding the polypeptides. 

This gene is expressed primarily in suppressor T cells, endothelial cells, 
dendritic cells, and infant brain. 

35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
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not limited to. immune system disorders related to abnormal activation of T cells. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
5 immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g. hematopoietic, developmental, neural, 
immune, endothelial, and cancerous and wounded tissues) or bodily fluids (e.g. lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 

10 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 188 as residues: Tyr-14 to Leu-24. Pro-59 to Gin-66. 

The tissue distribution in immune cells indicates that polynucleotides and 

15 polypeptides corresponding to this gene are useful for treating disorders of the immune 
system related to altered activation of T cells. Furthermore, this gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of immune disorders. 
Since the gene is expressed in cells of lymphoid origin, the gene or protein, as well as, 

20 antibodies directed against the protein may show utility as a tumor marker and/or 

immunotherapy targets for the above listed tissues. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immune deficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory bowel disease, 
sepsis, acne, and psoriasis. In addition, this gene product may have commercial utility 

25 in the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

30 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 39 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

35 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 593 of SEQ ID NO:39, b is an integer of 15 to 
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607, where both a and b eorrespond to the positions of nucleotide residues shown in 
SEQ ID NO:39, and where b is greater than or equal to a + 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

This gene is expressed primarily in the fetus and in tumor cell types. 
Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diseases of rapidly growing tissues such as cancers. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of rapdily growing 

15 tissues, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g. fetal, and cancerous and wounded tissues) or bodily 
fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

20 fluid from an individual not having the disorder. 

The tissue distribution of this gene primarily in the developing fetus indicates a 
role in the treatment and/or detection of developmental disorders and growth defects. In 
addition, expression in tumor cell types indicates a role in the detection and/or treatment 
of tumors. Furthermore, expression within fetal tissue and other cellular sources 

25 marked by proliferating cells indicates that this protein may play a role in the regulation 
of cellular division, and may show utility in the diagnosis and treatment of cancer and 
other proliferative disorders. Similarly, embryonic development also involves decisions 
involving cell differentiation and/or apoptosis in pattern formation. Thus this protein 
may also be involved in apoptosis or tissue differentiation and could again be useful in 

30 cancer therapy. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:4() and may have been publicly available prior to conception of the present 

35 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
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polynucleotides comprising a nucleotide sequence described by the genera] formula of 
a-b, where a is any integer between 1 to 868 of SEQ ID NO:40. b is an integer of 15 to 
882, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:40, and where b is greater than or equal to a + 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

This gene is expressed primarily in salivary gland, and to a lesser extent, in 
10 other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, digestive and immune disorders. Similarly, polypeptides and antibodies 
15 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the salivary gland and other glands of the 
exocrine system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g. exocrine, digestive, cancerous and 
20 wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid 

and spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
25 NO: 1 90 as residues: Glu-25 to Gly-3 1 , Tyr-62 to Thr-68. 

The tissue distribution in salivary gland tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
digestive and immune system disorders. Protein, as well as, antibodies directed against 
the protein may show utility as a tumor marker and/or immunotherapy targets for the 
30 above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:41 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
35 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
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a-b, where a is any integer between 1 to 945 of SEQ ID NO:41, b is an integer of 15 to 
959, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:41, and where b is greater than or equal to a + 14. 



5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

The gene encoding the disclosed cDNA is thought to reside on chromosome 12. 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
10 analysis for chromosome 12. 

This gene is expressed primarily in brain tissue of adults, as well as infants. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
15 not limited to. neurodegenerative and behavioural disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell typc(s). For a number of 
disorders of the above tissues or cells, particularly of the central and peripheral nervous 
system, expression of this gene at significantly higher or lower levels may be detected 
20 in certain tissues or cell types (e.g. brain, developmental, and cancerous and wounded 
tissues) or bodily fluids (e.g. lymph, amniotic fluid, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
25 Preferred epitopes include those comprising a sequence shown in SEQ ID 

NO: 191 as residues: Ser-16 to Val-33. 

The tissue distribution in neural tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states and behavioural disorders such as Alzheimers Disease, 
30 Parkinsons Disease, Huntintons Disease, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder and panic disorder. Furthermore, expression of this 
gene product within the brain indicates that it may be involved in neuronal survival: 
synapse formation; conductance; neural differentiation, etc. Such involvement may 
impact many processes, such as learning and cognition. Protein, as well as, antibodies 
35 directed against the protein may show utility as a tissue-specific marker and/or 
immunotherapy target for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:42 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 861 of SEQ ID NO;42, b is an integer of 15 to 
875, where both a and b correspond to the positions of nucleotide residues shown in 
10 SEQ ID NO:42, and where b is greater than or equal to a + 14. 



Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diseases affecting the synovial lining including arthritis and autoimmune 

20 disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s ). For a number of disorders of the above tissues or cells, particularly of 
the musculoskeletal system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g. endothelial, skeletal, and 

25 cancerous and wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, 

synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

30 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for use as a factor that may protect against 
articular damage or promote growth of the cells in articulating joints. Furthermore, the 
expression of this gene product in synovium would suggest a role in the detection and 
treatment of disorders and conditions affecting the skeletal system, in particular 

35 osteoporosis as well as disorders afflicting connective tissues (e.g. arthritis, trauma, 

tendonitis, chrondomalacia and inflammation), such as in the diagnosis or treatment of 
various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
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This gene is expressed primarily in the synovium. 
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dermatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias (le. spondyloepiphyseal dysplasia 
congenita, familial osteoarthritis, Atelosteogenesis type II. metaphyseal 
chondrodysplasia type Schmid). Protein, as well as. antibodies directed against the 
5 protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:43 and may have been publicly available prior to conception of the present 

10 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b. where a is any integer between 1 to 616 of SEQ ID NO:43, b is an integer of 15 to 

15 630, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:43, and where b is greater than or equal to a + 14. 
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When tested against U937 Myeloid cell lines, supernatants removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells through the Jak-STAT signal transduction pathway. The gamma 
activating sequence (GAS) is a promoter element found upstream of many genes which 
25 are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, signal 
transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and differentiation 
of cells. 

30 This gene is expressed primarily in B-cell lymphoma cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune disorders, such as diseases of B-cell lineage including 

35 lymphomas lymphoblastic leukemias. myelomas and hairy cell leukemia. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue! s) or cell type(s). For a 
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number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types (e.g. immune, cancerous and wounded tissues) or bodily fluids 
(e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
5 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 193 as residues: Lys-82 to Pro-90. 

10 The tissue distribution and biological activity indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for for the treatment and or diagnosis 
of diseases of B-cell lineage including cancer. This factor may be useful in the terminal 
differentiation of malignant cells or may act as a growth factor for B-cell proliferation or 
differentiation, which is supported by the biological assay data. Protein, as well as, 

1 5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:44 and may have been publicly available prior to conception of the present 

20 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b. where a is any integer between 1 to 557 of SEQ ID NO:44, b is an integer of 15 to 

25 571, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:44, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

30 

When tested against U937 Myeloid cell lines, supcrnatants removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells through the Jak-STAT signal transduction pathway. The gamma 
activating sequence (GAS) is a promoter element found upstream of many genes which 
35 are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, signal 
transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
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clement, can be used to indicate proteins involved in the proliferation and differentiation 
of cells. 

This gene is expressed primarily in osteoclastoma derived stromal cells, 
placenta, pancreas and several tumor derived cells and to a lesser extent in brain, 

5 melanocytes, dendritic cells, and several other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. tumors of the pancreas, uterus, ovary, bone, or adrenal gland. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the reproductive 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g. placenta, pancreas, cancerous and wounded tissues) 

15 or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

20 corresponding to this gene are useful for treating or diagnosing tumors of the 

reproductive organs, pancreas, or bone marrow. Furthermore, polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of various endocrine disorders and cancers, particularly Addison's disease, 
Cushing's Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes 

25 mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid 
(e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , 
hypothalamus, and testes. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:45 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

35 Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 916 of SEQ ID NO:45. b is an integer of 15 to 
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930, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:45, and where b is greater than or equal to a + 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

When tested against K562 leukemia cell lines, supernatants removed from cells 
containing this gene activated the ISRE assay. Thus, it is likely that this gene activates 
leukemia cells through the Jak-STAT signal transduction pathway. The interferon- 

10 sensitive response element is a promoter element found upstream of many genes which 
are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, signal 
transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the ISRE 
element, can be used to indicate proteins involved in the proliferation and differentiation 

15 of cells. 

This gene is expressed primarily in kidney and to a lesser extent in brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

20 not limited to. renal and nervous system disorders. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue! s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the renal and nervous systems, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues or 

25 cell types (e.g. renal, urogenital, endocrine, and cancerous and wounded tissues) or 
bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

30 Preferred epitopes include those comprising a sequence shown in SEQ ID 

NO: 195 as residues: Lys-1 17 to Lys-126. 

The tissue distribution of this gene in kidney tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
and/or detection of renal disorders including kidney failure and Wilms Tumor in 

35 addition to the detection/treatment of neurodegenerative disease states and behavioural 
disorders such as Alzheimers Disease, Parkinsons Disease, Huntintons Disease, 
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schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder and panic 
disorder. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
5 ID NO:46 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
10 a-b, where a is any integer between 1 to 423 of SEQ ID NO:46. b is an integer of 15 to 
437, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:46. and where b is greater than or equal to a + 14. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

One embodiment of this gene comprises polypeptides of the following amino 
acid sequence: 

MPKPGAATQRTLLCLPRLHPASGPPLPXAGPLRGLRQLPALPVPAASCRRRPAP 
20 RLCAAGPCTVGPAASPHAPPHGCPPPASLAHV AHRQSVSGTVCLGLRDGHV 
RGGCAAVRGXAALPWDAAAAGPDHMGVGSGPALL (SEQ ID NO:361). An 
additional embodiment is the polynucleotides encoding these polypeptides. 

This gene is expressed primarily in pituitary and to a lesser extent in thymus and 

breast. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. metabolic and immune disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

30 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types or cell types (e.g. thymus, cancerous and wounded tissues) or 
bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 

35 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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The tissue distribution of this gene indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and/or detection of 
endocrine, metabolic, and immune disorders including growth and developmental 
defects, in addition to the treatment or detection of immune or hematopoietic disorders 
5 including arthritis, asthma, immunodeficiency diseases and leukemia. Protein, as well 
as, antibodies directed against the protein may show utility as a tissue-specific marker 
and/or immunotherapy target for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

10 ID NO:47 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

15 a-b, where a is any integer between 1 to 1010 of SEQ ID NO:47, b is an integer of 15 
to 1024, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:47, and where b is greater than or equal to a + 14. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

This gene is expressed primarily in hemangiopericytoma. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. vascular disorders such as stroke, aneuynsm, cardiac arrest, 
hemorrhage. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

30 the vascular system, expression of this gene at significantly higher or lower levels may 
be detected in certain tissues or cell types (e.g. circulatory system, cancerous and 
wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

35 healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 197 as residues: Cys-14 to Gly-23. Met-45 to Gly-51. 
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The tissue distribution of this gene solely in hemangiopericytoma indicates that 
polynucleotides and polypeptides corresponding to this gene are useful lor the treatment 
and or detection of vascular disorders including hemorrhaging, aneuyrism, stroke and 
cardiac arrest. Protein, as well as, antibodies directed against the protein may show 

5 utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:48 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

10 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 449 of SEQ ID NO:48, b is an integer of 15 to 
463, where both a and b correspond to the positions of nucleotide residues shown in 

15 SEQ ID NO:48, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

20 The translation product of this gene shares sequence homology with a serine 

protease which is thought to be important in regulating the availibility and action of 
proteins in vivo. 

This gene is expressed primarily in cerebellum. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. disorders of the central nervous system related to abnormal growth factor 
regulation, including neurodegenerative conditions such as Alzheimers disease and 
psychiatric illness such as Schizophrenia. Similarly, polypeptides and antibodies 
30 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the Central Nervous System, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g. CNS, cancerous and wounded tissues) or bodily fluids (e.g. lymph, serum, 
35 plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expiession 
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level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 198 as residues: Ser-17 to Gln-22. 

The tissue distribution in neural tissue, combined with the homology to serine 
proteases indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for treating disorders of the central nervous system including neurodegenerative 
diseases and psychiatric disorders. Furthermore, expression of this gene product within 
cerebral tissue indicates that it may be involved in neuronal survival: synapse formation; 
conductance; neural differentiation, etc. Such involvement may impact many processes, 
such as learning and cognition. It may also be useful in the treatment of such 
neurodegenerative disorders as schizophrenia; ALS; or Alzheimer's. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:49 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 87 1 of SEQ ID NO:49, b is an integer of 1 5 to 
885, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:49, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

This gene is expressed primarily in CD34 depleted buffy coat. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, autoimmune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types (e.g. 
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immune, developmental, and cancerous and wounded tissues) or bodily fluids (e.g. 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder. 

The tissue distribution in CD34 depleted buffy coat tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treating 
disorders of the immune system including autoimmune diseases. Furthermore, 
expression of this gene product in CD34 depleted buffy coat indicates a role in the 

10 regulation of the proliferation; survival; differentiation; and/or activation of potentially 
all hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). Since the gene is expressed in cells of lymphoid origin, 

15 the gene or protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. Therefore 
it may be also used as an agent for immunological disorders including arthritis, asthma, 
immune deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
inflammatory bowel disease, sepsis, acne, and psoriasis. In addition, this gene product 

20 may have commercial utility in the expansion of stem cells and committed progenitors 
of various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

25 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 50 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

30 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 833 of SEQ ID NO: 50, b is an integer of 15 to 
847, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:50, and where b is greater than or equal to a + 14. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene is expressed primarily in B-cell lymphoma cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diseases of B-cell lineage including lymphomas lymphoblastic 
leukemias, myelomas and hairy cell leukemia. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types (e.g. 
immune, cancerous and wounded tissues) or bodily fluids (e.g. lymph, serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 

15 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in immunce cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for for the treatment and or diagnosis 

20 of diseases of B-cell lineage including cancer. This factor may be useful in the terminal 
differentiation of malignant cells or may act as a growth factor for B-cell proliferation or 
differentiation. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

25 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:51 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

30 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 566 of SEQ ID NO:5 1 , b is an integer of 1 5 to 
580, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:51, and where b is greater than or equal to a + 14. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

This gene is expressed primarily in brain and CD34 depleted buffy coat. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. autoimmune disorders particularly those of the central nervous system 
such as multiple sclerosis. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

10 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g. immune, neural, and 
cancerous and wounded tissues) or bodily fluids (e.g. lymph, amniotic fluid, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 

15 from an individual having such a disorder, relative to the standard gene expression 

level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:201 as residues: Pro-35 to Ala-40. 

20 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating autoimmune disorders such as 
multiple sclerosis. Furthermore, expression of this gene product in CD34 depleted 
buffy coat indicates a role in the regulation of the proliferation; survival; differentiation; 
and/or activation of potentially all hematopoietic cell lineages, including blood stem 

25 cells. This gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). Since the gene is expressed 
in cells of lymphoid origin, the gene or protein, as well as, antibodies directed against 
the protein may show utility as a tumor marker and/or immunotherapy targets for the 

30 above listed tissues. Therefore it may be also used as an agent for immunological 
disorders including arthritis, asthma, immune deficiency diseases such as AIDS, 
leukemia, rheumatoid arthritis, inflammatory bowel disease, sepsis, acne, and 
psoriasis. In addition, this gene product may have commercial utility in the expansion 
of stem cells and committed progenitors of various blood lineages, and in the 

35 differentiation and/or proliferation of various cell types. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:52 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 584 of SEQ ID NO:52, b is an integer of 15 to 
598, where both a and b correspond to the positions of nucleotide residues shown in 
10 SEQ ID NO:52, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

15 This gene is expressed primarily in tissues of the brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurological and neurodegenerative disorders. Similarly, polypeptides 

20 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the central nervous system, 
expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types (e.g. CNS, brain, cancerous and wounded tissues) or bodily fluids 

25 (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in brain tissue indicates that polynucleotides and 

30 polypeptides corresponding to this gene are useful as a neuronal protective agent and as 
a growth factor for cells of the central or peripheral nervous system. Furthermore, 
expression of this gene product within the brain indicates that it may be involved in 
neuronal survival; synapse formation; conductance; neural differentiation, etc. Such 
involvement may impact many processes, such as learning and cognition. Protein, as 

35 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:53 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 557 of SEQ ID NO:53, b is an integer of 1 5 to 
57 1 , where both a and b correspond to the positions of nucleotide residues shown in 
10 SEQ ID NO:53, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

15 The gene encoding the disclosed cDNA is thought to reside on chromosome 9. 

Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 9. 

This gene is expressed primarily in embryo and fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

25 tissues or cells, particularly of the embryo and fetal tissues, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types (e.g. 
fetal tissues, develpomental, cancerous and wounded tissues) or bodily fluids (e.g. 
lymph, amniotic fluid, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

30 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in embryonic and fetal tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of developmental disorders. Furthermore, the tissue distribution indicates 

35 that polynucleotides and polypeptides corresponding to this gene are useful for the 

diagnosis and treatment of cancer and other proliferative disorders. Expression within 
embryonic tissue and other cellular sources marked by proliferating cells indicates that 
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this protein may play a role in the regulation of cellular division. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the abovelisted tissues. 

iMany polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 54 and may have been publicly available prior to conception of the present 

10 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b. where a is any integer between 1 to 1233 of SEQ ID NO:54, b is an integer of 15 

15 to 1247, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:54, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

20 

The gene encoding the disclosed cDNA is thought to reside on chromosome 2. 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 2. 

This gene is expressed primarily in infant brain, placenta, seme immune tissues. 

25 and, to a lesser extent, in other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental and immune disorders. Similarly, polypeptides and 

30 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the early developmental stage tissues and 
immune tissues, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g. immune, cancerous and wounded tissues) 

35 or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
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the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:204 as residues: Val-32 to Met-39, Leu-44 to Val-49. 
5 The tissue distribution in fetal and immune tissues indicates that polynucleotides 

and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
developmental and immune disorders. Furthermore, the tissue distribution indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of cancer and other proliferative disorders. Expression within 

10 embryonic tissue and other cellular sources marked by proliferating cells indicates that 
this protein may play a role in the regulation of cellular division. Additionally, the 
expression in hematopoietic cells and tissues indicates that this protein may play a role 
in the proliferation, differentiation, and/or survival of hematopoietic ceii lineages. In 
such an event, this gene may be useful in the treatment of lymphoproliferative 

15 disorders, and in the maintenance and differentiation of various hematopoietic lineages 
from early hematopoietic stem and committed progenitor cells. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. The protein product of this 

20 gene is useful for the detection/treatment of neurodegenerative disease states, 
behavioural disorders, or inflamatory conditions such as Alzheimers Disease, 
Parkinsons Disease, Huntingtons Disease, Tourctte Syndrome, meningitis, 
encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, trauma, 
congenital malformations, spinal cord injuries, ischemia and infarction, aneurysms, 

25 hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive compulsive 

disorder, panic disorder, learning disabilities, ALS, psychoses . autism, and altered 
bahaviors, including disorders in feeding, sleep patterns, balance, and preception. In 
addition, elevated expression of this gene product in regions of the brain indicates that it 
plays a role in normal neural function. Potentially, this gene product is involved in 

30 synapse formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Moreover, the gene or gene product may also play a role in 
the treatment and/or detection of developmental disorders associated with the 
developing embryo, sexually-linked disorders, or disorders of the cardiovascular 
system. Protein, as well as, antibodies directed against the protein may show utility as a 

35 tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
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ID NO:55 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
5 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 834 of SEQ ID NO:55. b is an integer of 15 to 
848, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:55, and where b is greater than or equal to a + 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

When tested against Jurkat T-cells, supernatants removed from cells containing 
this gene activated the GAS assay. Thus, it is likely that this gene activates T-cells 
15 through the Jak-STAT signal transduction pathway. The gamma activating sequence 

(GAS) is a promoter element found upstream of many genes which are involved in the 
Jak-STAT pathway. The Jak-STAT pathway is a large, signal transduction pathway 
involved in the differentiation and proliferation of cells. Therefore, activation of the Jak- 
STAT pathway, reflected by the binding of the GAS element, can be used to indicate 
20 proteins involved in the proliferation and differentiation of cells. 

This gene is expressed primarily in brain tissues, and to a lesser extent, in T- 

cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. neuronal disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the brain and immune system, expression of this gene at 

30 significantly higher or lower levels may be detected in certain tissues or cell types (e.g. 
immune, brain, cancerous and wounded tissues) or bodily fluids (e.g. lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 

35 having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:205 as residues: Ser-33 to Ser-44. 
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The tissue distribution in T-cells indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neuronal and 
immune system disorders. Furthermore, expression of this gene product in T-cells, as 
well as the observed biological activity of this gene product, indicates that this gene 
5 product may be involved in the regulation of cytokine production, antigen presentation, 
or other processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). Since the gene is expressed in cells of lymphoid origin, 
the gene or protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. Therefore 

10 it may be also used as an agent for immunological disorders including arthritis, asthma, 
immune deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
inflammatory bowel disease, sepsis, acne, and psoriasis. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed progenitors 
of various blood lineages, and in the differentiation and/or proliferation of various cell 

15 types. Alternatively, the expression within brain tissue indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states, behavioural disorders, or inflamatory conditions such 
as Alzheimers Disease, Parkinsons Disease, Huntingtons Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 

20 trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, panic disorder, learning disabilities, ALS, psychoses , autism, 
and altered bahaviors, including disorders in feeding, sleep patterns, balance, and 
preception. In addition, elevated expression of this gene product in regions of the brain 

25 indicates that it plays a role in normal neural function. Potentially, this gene product is 

involved in synapse formation, neurotransmission, learning, cognition, homeostasis, or 
neuronal differentiation or survival. Moreover, the gene or gene product may also play a 
role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, sexually-linked disorders, or disorders of the cardiovascular 

30 system. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:56 and may have been publicly available prior to conception of the present 

35 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
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polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 655 of SEQ ID NO:56, b is an integer of 15 to 
669, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:56, and where b is greater than or equal to a + 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

When tested against U937 Myeloid cell lines, supernatants removed from cells 

10 containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells through the Jak-STAT signal transduction pathway. The gamma 
activating sequence (GAS) is a promoter element found upstream of many genes which 
are involved in the Jak-STAT pathway. When tested against K562 leukemia cell lines, 
supernatants removed from cells containing this gene activated the ISRE assay. Thus, it 

15 is likely that this gene activates leukemia cells through the Jak-STAT signal transduction 
pathway. The interferon-sensitive response element is a promoter element found 
upstream of many genes which are involved in the Jak-STAT pathway. The Jak-STAT 
pathway is a large, signal transduction pathway involved in the differentiation and 
proliferation of cells. Therefore, activation of the Jak-STAT pathway, reflected by the 

20 binding of the GAS element, can be used to indicate proteins involved in the 

proliferation and differentiation of cells. Contact of cells with supernatant expressing 
the product of this gene increases the permeability of bovine chondrocytes to calcium. 
Thus, it is likely that the product of this gene is involved in a signal transduction 
pathway that is initiated when the product of this gene binds a receptor on the surface of 

25 the chondrocyte cells. Thus, polynucleotides and polypeptides have uses which 
include, but are not limited to, activating bone cells. 

This gene is expressed primarily in breast and placenta. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. pregnancy disorders including miscarriage. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the breast and placenta , expression of this 

35 gene at significantly higher or lower levels may be detected in certain tissues or cell 

types (e.g. placental tissues, breast, bone, cancerous and wounded tissues) or bodily 
fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
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tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in both placenta and breast indicates a role for this protein 

5 in the treatment and/or detection of miscarriages in suspect individuals, of birth defects, 
of breast cancer, and female infertility. Furthermore, the biological assay data strongly 
indicates that the translation product of this gene is actively involved in the initiation of 
several signal transduction pathways and the activation of several cell types. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

10 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:57 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

15 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 666 of SEQ ID NO:57, b is an integer of 15 to 
680, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:57, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

The gene encoding the disclosed cDN A is thought to reside on chromosome 1 1 . 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
25 analysis for chromosome 1 1 . One embodiment of this gene comprises the polypeptides 
of the following amino acid sequence: 

MWGQPRPVDSVWSSSIPKKSVESNDNKSHLHKREH (SEQ ID NO:362), 
MTTKAIFTKGNIDSLSFKSNMWSVYI (SEQ ID NO:363). An additional 
embodiment is the polynucleotides encoding these polypeptides. 

30 This gene is expressed primarily in the pancreas. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. pancreatic related disorders such as diabetes. Similarly, polypeptides and 

35 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or ceii type(s). For a number of disorders 
of the above tissues or cells, particularly of the endocrine system, expression of this 
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gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g. pancreas, endocrine, metabolic, and cancerous and wounded tissues) or 
bodily fluids (e.g. lymph, serum, bile, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution of this gene in pancreatic tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment/detection of endocrine disorders and metabolic disorders associated with the 
pancreas including diabetes, pancreatitis, and pancreatic cancer. Furthermore, the tissue 
distribution indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the detection, treatment, and/or prevention of various endocrine disorders 
and cancers, particularly Addison's disease, Cushing's Syndrome, and disorders 
and/or cancers of the pancrease (e.g. diabetes mellitus). adrenal cortex, ovaries, 
pituitary (e.g.. hyper-, hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), 
parathyroid (e.g. hyper-, hypoparathyroidism) . hypothalamus, and testes. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:58 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 5 10 of SEQ ID NO:58. b is an integer of 15 to 
524, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:58, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

This gene is expressed primarily in chondrosarcoma tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, including diseases of the skeletal system, particularly with respect to the 
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cartilagenous structures and also cancer of these tissues. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the skeletal system, expression of this gene 
5 at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g. bone, connective, and cancerous and wounded tissues) or bodily fluids (e.g. 
lymph, scrum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

10 individual not having the disorder. 

The tissue distribution in chondrosarcoma tumors indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the treatment/diagnosis of 
cartilage disorders including arthritis and eancci. Protein, as well as, antibodies directed 
against the protein may show utility as a tissue-specific marker and/or immunotherapy 

15 target for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:59 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

20 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 413 of SEQ ID NO:59, b is an integer of 15 to 
427, where both a and b correspond to the positions of nucleotide residues shown in 

25 SEQ ID NO:59, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

30 The translation product of this gene shares sequence homology with 

sorbinwhich is thought to be important in the manufacture of vitamin C. Additionally, 
sorbin is thought to be important in the process of stimulating water and electrolyte 
absorption in various cells in the body. Porcine Sorbin has activity in stimulating water 
and electrolyte absorption across mucosa. It has been pursued as a regulator of 

35 electrolyte absorption in the nasal and enteric mucosa. This gene was identified in 
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electrolyte absorption. 
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This gene is expressed primarily in human hypothalamus tissue from a patient 
suffering from Alzheimer's disease. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurologic disorders (eg. Alzheimer's disease). Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the central nervous system, 

10 expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types (e.g. neural, and cancerous and wounded tissues) or bodily fluids 
(e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

15 individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:209 as residues: Leu-29 to Leu-37, Gln-65 to Asp-70, Gln-85 to Gly-95. 

The tissue distribution in neural tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 

20 Alzheimer's disease. Additionally, the translation product of this gene, based upon its 
homology to the porcine sorbin, could be useful for the detection and/or amelioration of 
disorders involving the CNS regulation of water or electrolyte absorption. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 60 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

30 Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1249 of SEQ ID NO:60. b is an integer of 15 
to 1263, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:60, and where b is greater than or equal to a + 14. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

This gene is expressed primarily in synovium, and to a lesser extent, in other 

tissues. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. synovial diseases such as synovial sarcoma. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
10 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the synovium, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types (e.g. 
connective tissues, cancerous and wounded ussucm ui bodily fluids (e.g. lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
5 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in synovium indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
:0 synovial diseases such as arthritis. Furthermore, the expression of this gene product in 
synovium would suggest a role in the detection and treatment of disorders and 
conditions affecting the skeletal system, in particular osteoporosis as well as disorders 
afflicting connective tissues (e.g. trauma, tendonitis, chrondomalacia and 
inflammation), such as in the diagnosis or treatment of various autoimmune disorders 
5 such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as well as 
dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias (ie. spondyloepiphyseal dysplasia congenita, familial osteoarthritis. 
Atelosteogenesis type IL metaphyseal chondrodysplasia type Schmid). Protein, as well 
as. antibodies directed against the protein may show utility as a tumor marker and/or 
3 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:61 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preieraDiy excmucu uum mc yn-^m m»^i»w>« ^ 
polynucleotides comprising a nucleotide sequence described by the general formula of 
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a-b, where a is any integer between 1 to 706 of SEQ ID NO:61 , b is an integer of 15 to 
720, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:61, and where b is greater than or equal to a + 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

This gene is expressed primarily in immune tissues and fast-growing tissues, 
such as tumor and early-stage developmental tissues, and, to a lesser extent, in some 

10 other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and growth related disorders. Similarly, polypeptides and 

15 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune tissues and fast-growing 
tissues, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g. developmental, immune, cancerous and wounded 

20 tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 

25 NO:2 1 1 as residues: Ala-28 to Ala-47. 

The tissue distribution in immune tissues indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection and treatment of 
immune and growth related diorders. Furthermore, expression within embryonic tissue 
and other cellular sources marked by proliferating cells indicates that this protein may 

30 play a role in the regulation of cellular division, and may show utility in the diagnosis 
and treatment of cancer and other proliferative disorders. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 

35 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:62 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 575 of SEQ ID NO:62, b is an integer of 15 to 
589, where both a and b correspond to the positions of nucleotide residues shown in 
10 SEQ ID NO:62, and where b is greater than or equal to a + 14. 

FEATURES OF FKOTEIN ENCODED BY GENE NO: 53 

15 One embodiment of this gene comprises polypeptides of the following amino 

acid sequence: DSXLDRRPSGPDVKFLSNKHHFSMVC (SEQ ID NO:364). An 
additional embodiment is the polynucleotides encoding these polypeptides. 

This gene is expressed primarily in spleen, and to a lesser extent, in a range of 
hematopoetic cell types. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoeitic disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

25 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune and hematopoetic systems, 
expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types (e.g. spleen, immune, cancerous and wounded tissues) or bodily 
fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 

30 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:212 as residues: Cys-25 to Trp-30. 

35 The tissue distribution of this gene in spleen tissue indicates that polynucleotides 

and polypeptides corresponding to this gene are useful for the treatment or detection of 
immune or hematopoietic disorders including arthritis, asthma, immunodeficiency 
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diseases and leukemia. Expression of this gene product in spleen indicates a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of potentially 
all hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). Since the gene is expressed in cells of lymphoid origin, 
the gene or protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. Therefore 
it may be also used as an agent for immunological disorders including arthritis, asthma, 
immune deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
inflammatory bowel disease, sepsis, acne, and psoriasis. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed progenitors 
of various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:63 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b. where a is any integer between 1 to 672 of SEQ ID NO:63. b is an integer of 15 to 
686. where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:63. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

This gene is expressed primarily in human normal breast, and to a lesser extent, 
in dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. glandular problems involving cells of epithelial origin including breast 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the 
female endocrine system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g. breast, cancerous and wounded 
tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal 

5 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:213 as residues: Ser-32 to Asn-44. 

10 The tissue distribution in breast tissue indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for the diagnosis or treatment of both 
malignant and non-malignant problems of the breast tissues, including cancer. 
Alternatively, the expression in dendritic tissue indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 

15 hematopoetic related disorders such as anemia, pancytopenia, leukopenia, 

thrombocytopenia or leukemia since stromal cells are important in the production of 
cells of hematopoietic lineages. The uses include bone marrow cell ex vivo culture, 
bone marrow transplantation, bone marrow reconstitution, radiotherapy or 
chemotherapy of neoplasia. The gene product may also be involved in lymphopoiesis, 

20 therefore, it can be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. Protein, as well as, antibodies 
directed against the protein may show utility as a tissue-specific marker and/or 

25 immunotherapy target for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 64 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

30 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 438 of SEQ ID NO:64, b is an integer of 15 to 
452, where both a and b correspond to the positions of nucleotide residues shown in 

35 SEQ ID NO:64, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 55 

When tested against U937 Myeloid cell lines, supernatants removed from cells 
5 containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells through the Jak-STAT signal transduction pathway. The gamma 
activating sequence (GAS) is a promoter element found upstream of many genes which 
are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, signal 
transduction pathway involved in the differentiation and proliferation of cells. 
10 Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 

element, can be used to indicate proteins involved in the proliferation and differentiation 
of cells. 

This gene is expressed primarily in early stage human tissues, immune tissues, 
and to a lesser extent, in other tissues such as prostate. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, development and immune related diseases. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

20 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune and early stage human tissues, 
expression of this gene at significantly higher or lower levels may be detected in certain 
tissues or cell types (e.g. immune, developmental, cancerous and wounded tissues) or 
bodily fluids (e.g. lymph, amniotic fluid, serum, plasma, urine, synovial fluid and 

25 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in embryonic and immune tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 

30 and treatment of developmental and immune related diseases. The biological activity 
data supports the assertion that the translation product of this gene is useful in the 
treatment and/or diagnosis of diseases related to the immune system. Protein, as well 
as, antibodies directed against the protein may show utility as a tissue-specific marker 
and/or immunotherapy target for the above listed tissues. 

35 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:65 and may have been publicly available prior to conception of the present 
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invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
5 a-b, where a is any integer between 1 to 356 of SEQ ID NO:65, b is an integer of 15 to 
370, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:65, and where b is greater than or equal to a + 14. 



10 FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

The translation product of this gene shares sequence homology with medicago 
sativa salt-inducible protein. 

This gene is expressed primarily in human chronic synovitis. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, skeletal or rheumatoid disorders, particularly, chronic synovitis. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
skeletal system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g. connective tissues, cancerous and wounded 
tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal 

25 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:215 as residues: Lys-30 to Ser-44, Pro-77 to His-82. 

30 The tissue distribution in synovium indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for diagnosis of and as a therapeutic 
agent for chronic synovitis. In addition, the expression of this gene product in 
synovium would suggest a role in the detection and treatment of disorders and 
conditions affecting the skeletal system, in particular osteoporosis as well as disorders 

35 afflicting connective tissues (e.g. arthritis, trauma, tendonitis, chrondomalacia and 

inflammation), such as in the diagnosis or treatment of various autoimmune disorders 
such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as well as 
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dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias (ie. spondyloepiphyseal dysplasia congenita, familial osteoarthritis, 
Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). Protein, as well 
as, antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:66 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 973 of SEQ ID NO:66, b is an integer of 15 to 
987, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:66, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

The translation product of this gene shares high sequence homology with the rat 
and mouse peroxisomal membrane proteins [gil297437], which appears to play a crucial 
role in transporting proteins into the organelle. Some human genetic disorders involving 
peroxisome biogenesis, such as Zellweger syndrome, may be caused by genetic defects 
of the import machinery located in the peroxisomal membrane. When tested against 
fibroblast cell lines, supernatants removed from cells containing this gene activated the 
EGR1 assay. Thus, it is likely that this gene activates fibroblast cells through a signal 
transduction pathway. Early growth response 1 (EGR1) is a promoter associated with 
certain genes that induces various tissues and cell types upon activation, leading the 
cells to undergo differentiation and proliferation. 

This gene is expressed primarily in normal human liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diseases of the hepatic system. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the hepatic disorders and liver metabolic 
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diseases, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g. liver, cancerous and wounded tissues) or bodily 
fluids (e.g. lymph, bile, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
5 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:216 as residues: Lys-57 to Ser-66. 

The tissue distribution in liver indicates that polynucleotides and polypeptides 

10 corresponding to this gene are useful for diagnosis and treatment of diseases relating to 
the liver. Furthermore, the homology indicates that the translational product of this gene 
may be useful in the detection and treatment of a number of disorders resulting from the 
improper transport of proteins into the organelle due to defects in peroxisomal 
membrane proteins, such as Zellweger syndrome. Protein, as well as. antibodies 

15 directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 67 and may have been publicly available prior to conception of the present 

20 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1004 of SEQ ID NO:67, b is an integer of 15 

25 to 1018, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:67, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 58 

30 

The gene encoding the disclosed cDNA is thought to reside on chromosome 4. 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 4. 

This gene is expressed primarily in human fetal dura mater. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type( s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
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not limited to, developmental or neurologic disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue( s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous system, expression of 
5 this gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g. brain, developmental, and cancerous and wounded tissues) or bodily fluids 
(e.g. lymph, amniotic fluid, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

10 fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:217 as residues: Ala- 19 to Lys-34. 

The tissue distribution in neural tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 

15 neurological diseases. Furthermore, the tissue distribution indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states and behavioural disorders such as Alzheimers Disease, 
Parkinsons Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder, panic disorder, learning 

20 disabilities, ALS, psychoses , autism, and altered bahaviors, including disorders in 

feeding, sleep patterns, balance, and preception. In addition, the gene or gene product 
may also play a role in the treatment and/or detection of developmental disorders 
associated with the developing embryo, ancl/or sexually-linked disorders. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

25 and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:68 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

30 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 748 of SEQ ID NO:68, b is an integer of 15 to 
762, where both a and b correspond to the positions of nucleotide residues shown in 

35 SEQ ID NO:68, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

The gene encoding the disclosed cDNA is thought to reside on chromosome 16. 
5 Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 16. 

This gene is expressed primarily in T helper cell and human uterine cancer. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, relating to hemopoietic and uterus disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell typc(s). For a number of disorders 
of the above tissues or cells, particularly of the immune and female reproductive 

15 system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g. immune, reproductive, and cancerous and wounded 
tissues) or bodily fluids (e.g. lymph, amniotic fluid, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

20 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in T-helper cells and uterine tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of disorders relating to both the immune and female 
reproductive systems. Expression of this gene product in T-cells indicates a role in the 

25 regulation of the proliferation; survival: differentiation; and/or activation of potentially 
all hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). Since the gene is expressed in cells of lymphoid origin, 

30 the gene or protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. Therefore 
it may be also used as an agent for immunological disorders including arthritis, asthma, 
immune deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, 
inflammatory bowel disease, sepsis, acne, and psoriasis. In addition, this gene product 

35 may have commercial utility in the expansion of stem cells and committed progenitors 
of various blood lineages, and in the differentiation and/or proliferation of various cell 
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types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
5 ID NO:69 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
10 a-b, where a is any integer between 1 to 616 of SEQ ID NO:69, b is an integer of 15 to 
630, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:69, and where b is greater than or equal to a + 14. 



15 FEATURES OF PROTEIN ENCODED BY GENE NO: 60 



This gene is expressed primarily in human fetal epithelium, and to a lesser 
extent, in testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. developmental or reproductive disorders, in addition to diseases of the 
integumentary system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

25 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the diseases relating to the epithelium, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types (e.g. 
epithelium, testes, developmental, and cancerous and wounded tissues) or bodily fluids 
(e.g. lymph, serum, amniotic fluid, plasma, urine, synovial fluid and spinal fluid) or 

30 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in fetal epithelium and testes indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 

35 diagnosis and treatment of epithelium related diseases. In addition, polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment, diagnosis, and/or 
prevention of various skin disorders including congenital disorders (i.e. nevi. moles. 
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freckles, Mongolian spots, hemangiomas, port-wine syndrome), integumentary tumors 
(i.e. keratoses, Bowen's disease, basal cell carcinoma, squamous cell carcinoma, 
malignant melanoma, Pagef s disease, mycosis fungoides, and Kaposi's sarcoma), 
injuries and inflammation of the skin (i.e. wounds, rashes, prickly heat disorder, 
5 psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, autoimmune 
disorders (i.e. lupus erythematosus, vitiligo, dermatomyositis, morphea, scleroderma, 
pemphigoid, and pemphigus), keloids, striae, erythema, petechiae, purpura, and 
xanthelasma. Moreover, such disorders may predispose increased susceptibility to viral 
and bacterial infections of the skin (i.e. cold sores, warts, chickenpox, molluscum 

10 contagiosum. herpes zoster, boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, 
and ringworm). Furthermore, the tissue distribution also indicates that the protein 
product of this gene is useful for the treatment and diagnosis of conditions concerning 
proper testicular function (e.g. endocrine function, sperm maturation), as well as 
cancer. Therefore, this gene product is useful in the treatment of male infertility and/or 

15 impotence. This gene product is also useful in assays designed to identify binding 

agents as such agents (antagonists) are useful as male contraceptive agents. Similarly, 
the protein is believed to by useful in the treatment and/or diagnosis of testicular cancer. 
The testes are also a site of active gene expression of transcripts that may be expressed, 
particularly at low levels, in other tissues of the body. Therefore, this gene product may 

20 be expressed in other specific tissues or organs where it may play related functional 
roles in other processes, such as hematopoiesis, inflammation, bone formation, and 
kidney function, to name a few possible target indications. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

25 ID NO:70 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

30 a-b, where a is any integer between 1 to 926 of SEQ ID NO:70. b is an integer of 15 to 
940, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:70, and where b is greater than or equal to a + 14. 

35 

FEATURES OF PROTEIN ENCODED BY GENE NO: 61 
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When tested against both U937 Myeloid cell and Jurkat T-cell cell lines, 
supernatants removed from cells containing this gene activated the GAS assay. Thus, it 
is likely that this gene activates both T-cells and myeloid cells through the Jak-STAT 
5 signal transduction pathway. The gamma activating sequence (GAS) is a promoter 
element found upstream of many genes which are involved in the Jak-STAT pathway. 
The Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate proteins 
10 involved in the proliferation and differentiation of cells. 

This gene is expressed primarily in human adult lymph node and in early stage 
human lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders, lymphatitis and pulmonary disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system 

20 and respiratory system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g. immune, cancerous and wounded 
tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 

25 or bodily fluid from an individual not having the disorder. 

The tissue distribution in adult lymph indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases relating to the immune system and respiratory system. Furthermore, 
expression of this gene product in lymph nodes indicates a role in the regulation of the 

30 proliferation; survival; differentiation; and/or activation of potentially all hematopoietic 
cell lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). Since the gene is expressed in cells of lymphoid origin, the gene or protein, 

35 as well as, antibodies directed against the protein may show utility as a tumor marker 

and/or immunotherapy targets for the above listed tissues. Therefore it may be also used 
as an agent for immunological disorders including arthritis, asthma, immune deficiency 
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diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory bowel disease, 
sepsis, acne, and psoriasis. In addition, this gene product may have commercial utility 
in the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
5 antibodies directed against the protein may show utility as a tumor marker and/or 

immunotherapy targets for the above listed tissues. The biological activity data supports 
the notion that the translation product of this gene is an activator of various cells of the 
immune system, and thus could play an important role in the activities of the immune 
system. 

10 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:71 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

15 Accordingly, preferably excluded from the present invention are one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1089 of SEQ ID NO:71, b is an integer of 15 
to 1 103, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:71, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

This gene is expressed primarily in glioblastoma and anergic T-cell. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neural and immune disorders, such as glioblastosis cerebri. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

30 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the CNS and immune 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g. immune, neural, and cancerous and wounded 
tissues) or bodily fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal 

35 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 



BNSDOCiD <WO 9922243A1 I > 



WO 99/22243 



90 



PCT/US98/22376 



The tissue distribution in T-cells indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of disorders relating to 
the CNS and the immune system. Furthermore, expression of this gene product in T- 
cells indicates a role in the regulation of the proliferation; survival; differentiation; 
5 and/or activation of potentially all hematopoietic cell lineages, including blood stem 
cells. This gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 
treatment of cancer (e.g. by boosting immune responses ). Since the gene is expressed 
in cells of lymphoid origin, the gene or protein, as well as, antibodies directed against 

10 the protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. Therefore it may be also used as an agent for immunological 
disorders including arthritis, asthma, immune deficiency diseases such as AIDS, 
leukemia, rheumatoid arthritis, inflammatory bowel disease, sepsis, acne, and 
psoriasis. In addition, this gene product may have commercial utility in the expansion 

15 of stem cells and committed progenitors of various blood lineages, and in the 

differentiation and/or proliferation of various cell types. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

20 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:72 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

25 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 885 of SEQ ID NO:72. b is an integer of 15 to 
899, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:72, and where b is greater than or equal to a + 14. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 



One embodiment of this gene comprises polypeptides of the following amino 
acid sequence: 

35 CLAEAVSVIQSIPIFNETGRFSFTLPYPVKIKVRFSFFLQIYLIMIFLGLYINFRHLY 
KQRRRRYGQKKKRSTKKKDLDGFLPV (SEQ ID NO:365). An additional 
embodiment is the polynucleotides encoding these polypeptides. 
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